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POLYPEPTIDES HAVING PHOSPHOLIPASE B ACTIVITY 
AND NUCLEIC ACIDS ENCODING SAME 

5 

Background of the Invention 

Field of the Invention 

The present invention relates to isolated polypeptides having phospholipase B 
10 activity and isolated nucleic acid sequences encoding the polypeptides. The invention 
also relates to nucleic acid constructs, vectors, and host cells comprising the nucleic acid 
sequences as well as methods for producing and using the polypeptides. 

Description of the Related Art 

15 Phospholipases are enzymes that participate in the hydrolysis of phospholipids 

which consist of a glycerol backbone with two fatty acids in an outer (sn-1) and the 
middle (sn-2) positions and esterified with phosphoric acid in the third position. The 
phosphoric acid may, in turn, be esterified to an amino alcohoL 

Several types of phospholipase activity can be distinguished which hydrolyze the 

20 fatty acyl moieties. Phospholipase Al and A2 catalyze the deacylation of one fatty acyl 
group in the sn-1 and sn-2 positions, respectively, from a diacylglycerophosphoKpid to 
produce lysophospholipid. Lysophospholipase (also called phospholipase B by the 
Nomenclature Committee of the International Union of Biochemistry on the 
Nomenclature and Classification of Enzymes {Enzyme Nomenclature, Academic Press, 

25 New York, 1992}) catalyzes the hydrolysis of the remaining fatty acyl group in a 
lysophospholipid. A phospholipase B has been reported from Penicillium notation (Saito 
et aL 9 1991, Methods in Enzymology 197:446-456) which catalyzes the deacylation of 
both fatty acids from a diacylglycerophospholipid and intrinsically possesses 
lysophospholipase activity. 

30 r Fungal enzymes with phospholipase activity have been reported from various 

sources, including Cryptococcus neoformans (Chen et oi, Infection and Immunity 65: 
405-411), Fusobacterium necrophorum fFifis et aL, 1996, Veterinary Microbiology 49: 
219-233), Penicillium notatum (also known as Penicillium chrysogenum; Kawasaki, 
1975, Journal of Biochemistry 77: 1233-1244; Masuda et al., 1991, European Journal of 
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Biochemistry 202: 783-787), Pemczllium cyclopium (Mustranta aL 9 1995, /Vocew 
Biochemistry 30: 393-401), Saccharomyces cerevisia (Ichimasa a/., 1985, 4gr/c BioL 
Chem. 49: 1083-1089; Paultaufe*a/., 1994, Journal of Biological Chemistry 269: 19725- 
19730), Torulaspora delbruecMi (old name Saccharomyces rosei, Kuwabara, 1988, 
> Agria BioL Chem. 52: 2451-2458; Watanabe et al. f 1994, FEMS Microbiological Letters 
124: 29-34), Schuosaccharomyces pombe (Oishi et al., 1996, BioscL Biotech. Biochem. 
60: 1087-1092), Neurospora crassa (Chakravarti et al 9 1981, Archives of Biochemistry 
and Biophysics 206: 393-402), Aspergillus niger (Technical Bulletin, G-zyme™ G6999, 
Enzyme Bio-Systems Ltd.; Mustranta et al. 9 1995, supra), Cortidum centrifugum 
(Uehara et al., 1979, Agria Biol. Chem. 43: 517-525), Fusarium oxysporum (WO 
98/26057), and Fusarium solani (Tsimg-Che al, 1968, Phytopathological Notes 
58:1437-38). 

Fungal phoq>holipase genes have been cloned from several sources including 
Penicillum notatum (Masuda et aL 9 1991, supra), Torulaspora delbrueckii (Watanabe et aL, 
15 1994, FEMS Microbiology Letters 124: 29-34), Saccharomyces cerevisia? (Lee at al. 9 
1994, Journal of Biological Chemistry 269: 19725-19730), Aspergillus (JP 10155493), 
Neurospora crassa (EMBL 042791), and Schizosaccharomyces pombe (EMBL 
013857). 

It is an object of the present invention to provide improved polypeptides having 
20 phospholipase B activity and nucleic acid encoding the polypeptides. 

Summary of the Invention 

The present invention relates to isolated polypeptides having phospholipase B 
25 activity selected from the group consisting of: 

(a) a polypeptide having an amino acid sequence which has at least 65% 
identity with amino acids 20 to 464 of SEQ ID NO. 2; 

(b) a polypeptide encoded by a nucleic acid sequence which hybridizes under 
low stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID NO. 1, (ii) the 

30 cDNA sequence contained in nucleotides 568 to 2045 of JSEQ ID NO. 1, (iii) a 
subsequence of (0 or (ii) of at least 100 nucleotides, or (iv) a complementary strand of 
(iX(n),or(iii); 

(c) a variant of the polypeptide having an amino acid sequence of SEQ ID 
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NO. 2 comprising a substitution, deletion, and/or insertion of one or more amino acids; 

(d) an allelic variant of (a) or (b); and 

(e) a fragment of (a), (b), or (d) that has phospholipase B activity. 

The present invention also relates to isolated nucleic acid sequences encoding the 
polypeptides and to nucleic acid constructs, vectors, and host cells comprising the nucleic 
acid sequences as well as methods for producing and using the polypeptides. 

Brief Description of the Figures 

Figures 1 A and IB show the genomic DNA sequence and the deduced amino add 
sequence of an Aspergillus oryzae phospholipase B (SEQ ID NOS. 1 and 2, 
respectively). 

Detailed Description of the Invention 


Polypeptides Having Phospholipase B Activity 

The terra '^phospholipase B activity" is defined herein as a carboxylic ester 
hydrolysis activity which catalyzes the deacylation of one or both of the fatty acyi groups 
in die sn-1 and sn-2 positions of a diacylglycerophospholipid. For purposes of the 

20 present invention, phospholipase B activity is determined by incubating the 
phospholipase with lysolecithin (or L-a-lysophosphatidylcholine) in the presence of 
calcium chloride at 37°C, pH 7 for 10 minutes and measuring the release of fatty acid 
using any method known in the art such as the NEFA C assay kit (Wako Chemicals, 
Richmond, VA) according to the manufacturer's instructions. One unit of phospholipase 

25 B activity is defined as 1 .0 pinole of free fatty acid produced per minute at 37°C, pH 7.0. 

In a first embodiment, the present invention relates to isolated polypeptides 
having an amino acid sequence which has a degree of identity to amino acids 20 to 464 
of SEQ ID NO. 2 (ile, the mature polypeptide) of at least about 65%, preferably at least 
about 70%, more preferably at least about 80%, even more preferably at least about 90%, 

30 most preferably at least about 95%, and even most preferably at least about 97%, which 
have phospholipase B activity (hereinafter "homologous polypeptides"). In a preferred 
embodiment, the homologous polypeptides have an amino acid sequence which differs 
by five amino acids, preferably by four amino acids, more preferably by three amino 
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acids, even more preferably by two amino acids, and most preferably by one amino acid 
from amino acids 20 to 464 of SEQ ID NO. 2. For purposes of the present invention, the 
degree of identity between two amino acid sequences is determined by the ChistalW 
method using the ClustalX software (Thompson et al; 9 1994, Nucleic Adds Research 22: 
5 4673-4680; Thompson et aL, 1997, Nucleic Acids Research 25: 4876-4882) with a 
blosnm weight matrix, gap opening penalty of 10, and gap extension penalty of 0.03 to 
0.05. 

Preferably, the polypeptides of the present invention comprise the amino acid 
sequence of SEQ ID NO. 2 or an allelic variant thereof, or a fragment thereof that has 

10 phosphohpase B activity. In a more preferred embodiment, the polypeptide of the 
present invention comprises the amino acid sequence of SEQ ID NO. 2. In another 
preferred embodiment, the polypeptide of the present invention comprises amino acids 
20 to 464 of SEQ ID NO. 2, or an allelic variant thereof or a fragment thereof that has 
phosphohpase B activity. In another preferred embodiment, the polypeptide of the 

15 present invention comprises amino acids 20 to 464 of SEQ ID NO. 2. In another 
preferred embodiment, the polypeptide of the present invention consists of the amino 
acid sequence of SEQ ID NO. 2 or an allelic variant thereof; or a fragment thereof that 
has phosphohpase B activity. In another preferred embodiment, the polypeptide of the 
present invention consists of the amino acid sequence of SEQ ID NO. 2. In another 

20 preferred embodiment, the polypeptide consists of amino acids 20 to 464 of SEQ ID NO. 
2 or an allelic variant thereof; or a fragment thereof that has phosphohpase B activity. In 
another preferred embodiment, the polypeptide consists of amino acids 20 to 464 of SEQ 
ID NO. 2. 

A fragment of SEQ ID NO. 2 is a polypeptide having one or more amino acids 
25 deleted from the amino and/or carboxyl terminus of this amino acid sequence. 
Preferably, a fragment contains at least 375 amino acid residues, more preferably at least 
400 ammo acid residues, and most preferably at least 425 amino acid residues. 

An allelic variant denotes any of two or more alternative forms of a gene 
occupying the same chromosomal locus. Allelic variation arises naturally through 
JO mutation, and may result in polymorphism within populations. Gepe mutations can be 
silent (no change in the encoded polypeptide) or may encode polypeptides having altered 
amino acid sequences. An allelic variant of a polypeptide is a polypeptide encoded by an 
allelic variant of a gene. 


WO 01/29222 


PCT/USOO/29140 


In a second embodiment, the present invention, relates to isolated polypeptides 
having phospholipase B activity which are encoded by nucleic acid sequences which 
hybridize under very low stringency conditions, preferably low stringency conditions, 
more preferably medium stringency conditions, more preferably medium-high stringency 
5 conditions, even more preferably high stringency conditions, and most preferably very 
high stringency conditions with a nucleic acid probe which hybridizes under the same 
conditions with Q) nucleotides 568 to 2045 of SEQ ID NO. 1, (ii) the cDNA sequence 
contained in nucleotides 568 to 2045 of SEQ ID NO. 1, (iii) a subsequence of© or (ii), 
or (iv) a complementary strand of (i), (ii), or (iii) (I Sambrook, EJF. Fritsch, and T. 

10 Maniatus, 1989, Molecular Cloning, A Laboratory Manual* 2d edition, Cold Spring 
Harbor, New York). The subsequence of SEQ ID NO. 1 may be at least 100 nucleotides 
or preferably at least 200 nucleotides. Moreover, the subsequence may encode a 
polypeptide fragment which has phospholipase B activity. The polypeptides may also be 
allelic variants or fragments of the polypeptides that have phospholipase B activity. 
* 15 The nucleic acid sequence of SEQ ID NO. 1 or a subsequence thereof, as well as 

the amino acid sequence of SEQ ID NO. 2 or a fragment thereof, may be used to design a 
nucleic acid probe to identify and clone DNA encoding polypeptides having 
phospholipase B activity from strains of different genera or species according to methods 
well known in the art. In particular, such probes can be used for hybridization with the 

20 genomic or cDNA of the genus or species of interest, following standard Southern 
blotting procedures, in order to identify and isolate the corresponding gene therein. Such 
probes can be considerably shorter than the entire sequence, but should be at least 15, 
preferably at least 25, and more preferably at least 35 nucleotides in length. Longer 
probes can also be used. Both DNA and RNA probes can be used. The probes are 

25 typically labeled for detecting the corresponding gene (for example, with 32 P, *H, 35 S, 
biotin, or avidin). Such probes are encompassed by the present invention. 

Thus, a genomic DNA or cDNA library prepared from such other organisms may 
be screened for DNA which hybridizes with the probes described above and which 
encodes a polypeptide having phospholipase B activity. Genomic or other DNA from 

30 such other organisms may be separated by agarose or polyacrylamide gel electrophoresis, 
or other separation techniques. DNA from the libraries or the separated DNA may be 
transferred to and immobilized on nitrocellulose or other suitable carrier material. In 
order to identify a clone or DNA which is homologous with SEQ ID NO. 1 or a 
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subsequence thereof the carrier material is used in a Southern blot For purposes of the 
present invention, hybridization indicates that the nucleic acid sequence hybridizes to a 
labeled nucleic acid probe corresponding to the nucleic acid sequence shown in SEQ ID 
NO. 1, its complementary strand, or a subsequence thereof under very low to very high 
5 stringency conditions. Molecules to which the nucleic acid probe hybridizes under these 
conditions axe detected using X-ray film. 

In a preferred embodiment, the nucleic acid probe is a nucleic acid sequence 
which encodes the polypeptide of SEQ ID NO. 2, or a subsequence thereof In another 
preferred embodiment, the nucleic acid probe is SEQ ID NO. 1. In another preferred 

10 embodiment, the nucleic acid probe is the mature polypeptide coding region of SEQ ID 
NO. 1. In another preferred embodiment, the nucleic acid probe is the nucleic acid 
sequence contained in plasmid pPH6 which is contained in Escherichia coli NRRL B- 
30142, wherein the nucleic acid sequence encodes a polypeptide having acid phosphatase 
activity. In another preferred embodiment, the nucleic acid probe is the mature 

15 polypeptide coding region contained in plasmid pPH6 which is contained in Escherichia 
co/i'NRRLB-30142. 

For long probes of at least 100 nucleotides in length, very low to very higih 
stringency conditions are defined as prehybridization and hybridization at 42°C in 5X 
SSPE, 0.3% SDS, 200 ^lg/ml sheared and denatured salmon sperm DNA, and either 25% 

20 formamide for very low and low stringencies, 35% formamide for medium and medium- 
high stringencies, or 50% formamide for high and very high stringencies, following 
standard Southern blotting procedures. 

For long probes of at least 100 nucleotides in length, the carrier material is finally 
washed three times each for 15 minutes using 2 x SSC, 0.2% SDS preferably at least at 

25 45°C (very low stringency), more preferably at least at 50°C (low stringency), more 
preferably at least at 55°C (medium stringency), more preferably at least at 60°C 
(medium-high stringency), even more preferably at least at 65°C (high stringency), and 
most preferably at least at 70°C (vary high stringency). 

For short probes which are about 15 nucleotides to about 70 nucleotides in length, 

30 stringency conditions are defined as prehybridization, hybridization, and washing post- 
hybridization at about 5°C to about 10°C below the calculated T„ using the calculation 
according to Bolton and McCarthy (1962, Proceedings of the National Academy of 
Sciences USA 48:1390) in 0.9 M NaCl, 0.09 M Tris-HCl pH 7.6, 6 mM EDTA, 0.5% 
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NP-40, IX Denhardt's solution, 1 mM sodium pyrophosphate, 1 mM sodium monobasic 
phosphate, 0.1 mM ATP, and 02 mg of yeast RNA per ml following standard Southern 
blotting procedures. 

For short probes which are about 15 nucleotides to about 70 nucleotides in length, 
5 the carrier material is washed once in 6X SCC plus 0.1% SDS for 15 minutes and twice 
each for 15 minutes using 6X SSC at 5°C to 10°C below the calculated 

Jn a third embodiment, the present invention relates to variants of the polypeptide 
having an amino acid sequence of SEQ ID NO. 2 comprising a substitution, deletion, 
and/or insertion of one or more amino acids! 

10 The amino acid sequences of the variant polypeptides may differ from the amino 

acid sequence of SEQ ID NO. 2 or the mature polypeptide thereof by an insertion or 
deletion of one or more amino acid residues and/or the substitution of one or more amino 
acid residues by different amino acid residues. Preferably, amino acid changes are of a 
minor nature, that is conservative amino acid substitutions that do not significantly affect 

15 the folding and/or activity of the protein; small deletions, typically of one to about 30 
amino acids; small amino- or caiboxyl-terminal extensions, such as an amino-terminal 
methionine residue; a small linker peptide of up to about 20^25 residues; or a small 
extension that facilitates purification by changing net charge or another function, such as 
a poly-histidine tract, an antigenic epitope or a binding domain. 

20 Examples of conservative substitutions are within the group of basic amino acids 

(arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), 
polar amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, 
isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), 
and small amino acids (glycine, alanine, serine, threonine and methionine). Amino acid 

25 substitutions which do not generally alter the specific activity are known in the art and 
are described, for example, by H. Neurath and RX. Hill, 1979, In, The Proteins, 
Academic Press, New York. The most commonly occurring exchanges are Ala/Ser, 
Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyi/Phe, 
Ala/Pro, Lys/Arg, Asp/Asn, Leu/De, Leu/Val, Ala/Glu, and Asp/Gly as well as these in 

30 reverse. 

In a fourth embodiment, the present invention relates to isolated polypeptides 
having immunochemical identity or partial immunochemical identity to the polypeptide 
having the amino acid sequence of SEQ ID NO. 2 or the mature polypeptide thereof. The 
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immunochemical properties are determined by immunological cross-reaction identity 
tests by the well-known Ouchterlony double immunodiffusion procedure. Specifically, 
an antiserum containing polyclonal antibodies which are immunoreactive or bind to 
epitopes of the polypeptide having the amino acid sequence of SEQ ID NO. 2 or the 
5 mature polypeptide thereof are prepared by immunizing rabbits (or other rodents) 
according to the procedure described by Harboe and Ingild, In NIL Axelsen, J. Krall, 
and B. Weeks, editors, A Manual of Quantitative Immunoelectrophoresis y Blackwell 
Scientific Publications, 1973, Chapter 23, or Johnstone and Thorpe, Immunochemistry in 
Practice, Blackwell Scientific Publications, 1982 (more specifically pages 27-31). A 

10 polypeptide having immunochemical identity is a polypeptide which reacts with the 
antiserum in an identical fashion such as total fusion of precipitates, identical precipitate 
morphology, and/or identical electrophoretic mobility using a specific immunochemical 
technique. A further explanation of immunochemical identity is described by Axelsen, 
Bock, and Krall, In NJI. Axelsen, J. Krall, and B. Weeks, editors, A Manual of 

15 Quantitative Immunoelectrophoresis, Blackwell Scientific Publications, 1973, Chapter 
10. A polypeptide having partial immunochemical identity is a polypeptide which reacts 
with the antiserum in a partially identical fashion such as partial fusion of precipitates, 
partially identical precipitate morphology, and/or partially identical electrophoretic 
mobility using a specific immunochemical technique. A further explanation of partial 

20 immunochemical identity is described by Bock and Axelsen, In NJI. Axelsen, J. Krall, 
and B. Weeks, editors, A Manual of Quantitative Immunoelectrophoresis, Blackwell 
Scientific Publications, 1 973, Chapter 11. 

The antibody may also be a monoclonal antibody. Monoclonal antibodies may be 
prepared and used, e.g., according to the methods of E. Harlow and D. Lane, editors, 

25 1988, Antibodies, A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, 
New York. 

The polypeptides of the present invention have at least 20%, preferably at least 
40%, more preferably at least 60%, even more preferably at least 80%, even more 
preferably at least 90%, and most preferably at least 100% of the pbospholipase B 
30 activity of the mature polypeptide of SEQ ID NO. 2. 

A polypeptide of the present invention may be obtained from microorganisms of 
any genus. For purposes of the present invention, the term "obtained from" as used 
herein in connection with a given source shall mean that the polypeptide encoded by the 
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nucleic acid sequence is produced by the source or by a cell in which the nucleic acid 
sequence from the source has been inserted. In a preferred embodiment, the polypeptide 
is secreted extracellularly. 

A polypeptide of the present invention may be a bacterial polypeptide. For 
5 example, the polypeptide may be a gram positive bacterial polypeptide such as z Bacillus 
polypeptide, e.g 9 a Bacillus alkalophilus, Bacillus amyloliquefaciens. Bacillus brevis, 
Bacillus circulans, Bacillus coagulans, Bacillus lautus 9 Bacillus lentus, Bacillus 
licheniformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, or 
Bacillus thuringiensis polypeptide; or a Streptomyces polypeptide, eg:, a Streptomyces 

10 lividans or Streptomyces murinus polypeptide; or a gram negative bacterial polypeptide, 
e.g. t an E. coli or a Pseudomonas sp. polypeptide. 

A polypeptide of the present invention may be a fungal polypeptide, and more 
preferably a yeast polypeptide such as a Candida, Kluyveromyces, Pichia, 
Saccharomyces, Schizosaccharomyces, or Yarrowia polypeptide; or more preferably a 

15 filamentous fungal polypeptide such as an Acremonium, Aspergillus, Aureobasidium 9 
Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, 
Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, 
Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium 9 or Trichoderma 
polypeptide. 

20 In a preferred embodiment, the polypeptide is a Saccharomyces carlsbergensis, 

Saccharomyces cerevisiae, Saccharomyces diastaticus 9 Saccharomyces douglasii 9 
Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis 
polypeptide. 

In another preferred embodiment, the polypeptide is a Fusarium bactridioides, 
25 Fusarium cerealis y Fusarium crookwellense, Fusarium culmorum 9 Fusarium 
graminearum 9 Fusarium graminum 9 Fusarium heterosporum, Fusarium negundi, 
Fusarium oxysporum y Fusarium reticulatum 9 Fusarium roseum y Fusarium sambucinum, 
Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum 9 Fusarium 
torulosum, Fusarium trichothecioides 9 Fusarium venenatum 9 Humicola insolens, 
30 Humicola lanuginosa, Mucor miehei, Mycettophthora thermophila, Neurospora crassa, 
Penicillium purpurogenum, Trickoderma harzianum, Trichoderma koningii, 
Trickoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride polypeptide. 
In another preferred embodiment, the polypeptide is an Aspergillus aculeatus, 
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Aspergillus awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulatts, 
Aspergillus niger, or Aspergillus oryzae polypeptide. 

In a more preferred embodiment, the polypeptide is an Aspergillus oryzae 
polypeptide, ag., the polypeptide with the amino acid sequence of SEQ ID NO. 2. 
5 It will be understood that for the aforementioned species, the invention 

encompasses both the perfect and imperfect states, and other taxonomic equivalents, eg:, 
anamorphs, regardless of the species name by which they are known. Those skilled in 
the art will readily recognize the identity of appropriate equivalents. For example, 
taxonomic equivalents of Aspergillus are defined by Raper, KJX and Fennel, D.L, 1965, 

10 The Genus Aspergillus, The Wilkins Company, Baltimore. Aspergilli are mitosporic 
fungi characterized by an aspergillum conprised of a conidiospore stipe with no known 
teleomorphic states terminating in a vesicle, which in turn bears one or two layers of 
synchronously formed specialized cells, variously referred to as sterigmata or phialides, 
and asexually formed spores referred to as cqnidia. Known teleomorphs of Aspergillus 

15 include Eurotium, Neosartorya, and Emericella. 

Strains of these species are readily accessible to the public in a number of culture 
collections, such as the American Type Culture Collection (ATCC), Deutsche Sammlung 
von Mikroorganismen und Zellkulturen GmbH (DSM), Centraalbureau Voor 
Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, 

20 Northern Regional Research Center (NRRL). 

Furthermore, such polypeptides may be identified and obtained from other 
sources including microorganisms isolated from nature (eg. y soil, composts, water, etc.) 
using the above-mentioned probes. Techniques for isolating microorganisms from 
natural habitats are well known in the art The nucleic acid sequence may then be 

25 derived by similarly screening a genomic or cDNA library of another microorganism. 
Once a nucleic acid sequence encoding a polypeptide has been detected with the 
probe(s), the sequence may be isolated or cloned by utilizing techniques which are 
known to those of ordinary skill in the art (see, e»g. 9 Sambrook et aL y 1989, supra). 

As defined herein, an "isolated** polypeptide is a polypeptide which is essentially 

30 free of other non-phospholipase B polypeptides, ag:, at least about 20% pure, preferably 
at least about 40% pure, more preferably about 60% pure, even more preferably about 
80% pure, most preferably about 90% pure, and even most preferably about 95% pure, as 
determined by SDS-PAGE. 

i 
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Polypeptides encoded by nucleic acid sequences of the present invention also 
include fused polypeptides or cleavable fusion polypeptides in which another polypeptide 
is fused at the N-terminus or the C-terminus of the polypeptide or fragment thereof A 
fused polypeptide is produced by fusing a nucleic acid sequence (or a portion thereof) 
5 encoding another polypeptide to a nucleic acid sequence (or a portion thereof) of the 
present invention. Techniques for producing fusion polypeptides are known in the art, 
and include ligating the coding sequences encoding the polypeptides so that they are in 
frame and that expression of the fused polypeptide is under control of the same 
promoters) and terminator. 

10 

Nucleic Acid Sequences 

The present invention also relates to isolated nucleic acid sequences which 
encode a polypeptide of the present invention. In a preferred embodiment, the nucleic 
acid sequence is set forth in SEQ ID NO. 1. In another more preferred embodiment, the 

15 nucleic acid sequence is the sequence contained in plasmid pPH6 that is contained in 
Escherichia coli NRRL B-30142. In another preferred embodiment, the nucleic acid 
sequence is the mature polypeptide coding region of SEQ ID NO. 1. In another more 
preferred embodiment, the nucleic acid sequence is the mature polypeptide coding region 
contained in plasmid pPH6 that is contained in Escherichia coli NRRL B-30142. The 

20 present invention also encompasses nucleic acid sequences which encode a polypeptide 
having the amino acid sequence of SEQ ID NO. 2 or the mature polypeptide thereof, 
which differ from SEQ ID NO. 1 by virtue of the degeneracy of the genetic code. The 
present invention also relates to subsequences of SEQ ID NO. 1 which encode fragments 
of SEQ ID NO. 2 that have phospholipase B activity. 

25 A subsequence of SEQ ID NO. 1 is a nucleic acid sequence encompassed by SEQ 

ID NO. 1 except that one or more nucleotides from the 5' and/or 3' end have been deleted. 
Preferably, a subsequence contains at least 1125 nucleotides, more preferably at least 
1200 nucleotides, and most preferably at least 1275 nucleotides. 

The present invention also relates to mutant nucleic acid sequences comprising at 

30 least one mutation in the mature polypeptide coding sequence of SEQ ID NO. 1, in which 
the mutant nucleic acid sequence encodes a polypeptide which consists of amino acids 20 
to464ofSEQIDN0.2. 

The techniques used to isolate or clone a nucleic acid sequence encoding a 
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polypeptide are known in the art and include isolation from genomic DNA, preparation 
from cDNA, or a combination thereof. The cloning of the nucleic acid sequences of the 
present invention from such genomic DNA can be effected, e.g. 9 by using the well known 
polymerase chain reaction (PGR) or antibody screening of expression libraries to detect 
5 cloned DNA fragments with shared structural features. See, eg., Innis et al, 1990, PCR: 
A Guide to Methods and Application, Academic Press, New York. Other nucleic acid 
amplification procedures such as ligaSe chain reaction (LCR), ligated activated 
transcription (LAI) and nucleic acid sequence-based amplification (NASBA) may be 
used. The nucleic acid sequence may be cloned from a strain of Aspergillus, or another 

10 or related organism and thus, for example, may be an allelic or species variant of the 
polypeptide encoding region of the nucleic acid sequence. 

The term "isolated nucleic acid sequence** as used herein refers to a nucleic acid 
sequence which is essentially free of other nucleic acid sequences, e.g., at least about 
20% pure, preferably at least about 40% pure, more preferably at least about 60% pure, 

15 even more preferably at least about 80% pure, and most preferably at least about 90% 
pure as determined by agarose electrophoresis. For example, an isolated nucleic acid 
sequence can be obtained by standard cloning procedures used in genetic engineering to 
relocate the nucleic acid sequence from its natural location to a different site where it will 
be reproduced. The cloning procedures may involve excision and isolation of a desired 

20 - nucleic acid fragment comprising the nucleic acid sequence encoding the polypeptide, 
insertion of the fragment into a vector molecule, and incorporation of the recombinant 
vector into a host cell where multiple copies or clones of the nucleic acid sequence will 
be replicated. The nucleic acid sequence may be of genomic, cDNA, RNA, 
semisynthetic, synthetic origin, or any combinations thereof. 

25 The present invention also relates to nucleic acid sequences which have a degree 

of homology to the mature polypeptide coding sequence of SEQ ID NO. 1 (i.e., 
nucleotides 568 to 2045) of at least about 65%, preferably about 70%, preferably about 
80%, more preferably about 90%, even more preferably about 95%, and most preferably 
about 97% homology, which encode an active polypeptide. For purposes of the present 

30 invention, the degree of homology between two nucleic acid sequences is determined by 
the Wilbur-Lipman method (Wilbur and Lipman, 1983, Proceedings of the National 
Academy of Science USA 8(h 726-730) using the LASERGENE™ MEGALIGN™ 
software (DNASTAR, Inc., Madison, WI) with an identity table and the following 
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multiple alignment parameters: Gap penalty of 10 and gap length penalty of 10. Pairwise 
alignment parameters were Ktuple=3, gap penalty=3, and windows=20. 

Modification of a nucleic acid sequence encoding a polypeptide of the present 
invention may be necessary for the synthesis of polypeptides substantially similar to the 
5 potypeptide. The term "substantially similar" to the polypeptide refers to non-naturally 
occurring forms of the polypeptide. These polypeptides may differ in some engineered 
way from the polypeptide isolated from its native source, variants that differ in 
specific activity, thermostability, pH optimum, or the like. The variant sequence may be 
constructed on the basis of the nucleic acid sequence presented as the polypeptide 

10 encoding part of SEQ ID NO. 1, e.g. 9 a subsequence thereof, and/or by introduction of 
nucleotide substitutions which do not give rise to another amino acid sequence of the 
polypeptide encoded by the nucleic acid sequence, but which correspond to the codon 
usage of the host organism intended for production of the enzyme, or by introduction of 
nucleotide substitutions which may give rise to a different amino acid sequence. For a 

15 general description of nucleotide substitution, see, e.g. y Ford et a/., 1991, Protein 
Expression and Purification 2: 95-107. 

It will be apparent to those skilled in the art that such substitutions can be made 
outside the regions critical to the function of the molecule and still result in an active 
polypeptide. Amino acid residues essential to the activity of the polypeptide encoded by 

20 the isolated nucleic acid sequence of the invention, and therefore preferably not subject to 
substitution, may be identified according to procedures known in the art, such as site- 
directed mutagenesis or alanine-scarming mutagenesis (see, eg., Cunningham and Wells, 
1989, Science 244: 1081-1085). In the latter technique, mutations are introduced at every 
positively charged residue in the molecule, and the resultant mutant molecules are tested 

25 for phospholipase B activity to identify amino acid residues that are critical to the activity 
of the molecule. Sites of substrate-enzyme interaction can also be determined by analysis 
of the three-dimensional structure as determined by such techniques as nuclear magnetic 
resonance analysis, crystallography or photoaffinity labelling (see, e.g. y de Vos et aL 9 
1992, Science 255: 306-312; Smith et al y 1992, Journal of Molecular Biology 224: 899- 

30 904; Wlodaver et al y 1992, FEB S Letters 309: 59-64). 

The present invention also relates to isolated nucleic acid sequences encoding a 
polypeptide of the present invention, which hybridize under very low stringency 
conditions, preferably low stringency conditions, more preferably medium stringency 
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conditions, more preferably medium-high stringency conditions, even more preferably 
high stringency conditions, and most preferably very high stringency conditions with a 
nucleic acid probe which hybridizes under the same conditions with the nucleic acid 
sequence of SEQ ID NO. 1 or its complementary strand; or allelic variants and 
5 subsequences thereof (Sambrook et al , 1989, supra), as defined herein. 

The present invention also relates to isolated nucleic acid sequences produced by 
(a) hybridizing a DNA under very low, low, medium, medium-high, high, or very high 
stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID NO. 1, (ii) the cDNA 
sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, (iii) a subsequence of 
10 (i) or (ii), or (iv) a complementary strand of (i), (ii), or (iii); and (b) isolating the nucleic 
acid sequence. The subsequence is preferably a sequence of at least 100 nucleotides such 
as a sequence which encodes a polypeptide fragment which has phospholipase B activity. 

Methods for Producing Mutant Nucleic Acid Sequences 

15 The present invention further relates to methods for producing a mutant nucleic 

acid sequence, comprising introducing at least one mutation into the mature polypeptide 
coding sequence of SEQ ID NO. 1 or a subsequence thereof, wherein the mutant nucleic 
acid sequence encodes a polypeptide which consists of amino acids 20 to 464 of SEQ ID 
NO. 2 or a fragment thereof which has phospholipase B activity. 

20 The introduction of a mutation into the nucleic acid sequence to exchange one 

nucleotide for another nucleotide may be accomplished by site-directed mutagenesis 
using any of the methods known in the art Particularly useful is the procedure which 
utilizes a supercoiled, double stranded DNA vector with an insert of interest and two 
synthetic primers containing the desired mutation. The oligonucleotide primers, each 

25 complementary to opposite strands of the vector, extend during temperature cycling by 
means of Pfu DNA polymerase. On incorporation of the primers, a mutated plasmid 
containing staggered nicks is generated. Following temperature cycling, the product is . 
treated with Dpnl which is specific for methylated and hemimethylated DNA to digest 
the parental DNA template and to select for mutation-containing synthesized DNA. 

30 Other procedures known in the art may also be used. 

Nucleic Acid Constructs 

The present invention also relates to nucleic acid constructs comprising a nucleic 
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acid sequence of the present invention operably linked to one or more control sequences 
which direct the expression of the coding sequence in a suitable host cell under 
conditions compatible with the control sequences. Expression will be understood to 
include any step involved in the production of the polypeptide including, but not limited 
5 to, transcription, post-transcriptional modification, translation, post-translational 
modification, and secretion. 

"Nucleic acid construct" is defined herein as a nucleic acid molecule, either 
single- , or double-stranded, which is isolated from a naturally occurring gene or which 
has been modified to contain segments of nucleic acid combined and juxtaposed in a 

10 manner that would not otherwise exist in nature. The term nucleic acid construct is 
synonymous with the term expression cassette when the nucleic acid construct contains 
all the control sequences required. for expression of a coding sequence of the present 
invention. The term "coding sequence'* is defined herein as a nucleic acid sequence 
which directly specifies the amino acid sequence of its protein product The boundaries 

15 of a genomic coding sequence are generally determined by a ribosome binding site 
(prokaryotes) or by the ATG start codon (eukaryotes) located just upstream of the open 
reading frame at the 5* end of the mRNA and a transcription terminator sequence located 
just downstream of the open reading fiame at the 3* end of the mRNA* A coding 
sequence can include, but is not limited to, DNA, cDNA, and recombinant nucleic acid 

20 sequences. 

An isolated nucleic acid sequence encoding a polypeptide of the present invention 
may be manipulated in a variety of ways to provide for expression of the polypeptide. 
Manipulation of the nucleic acid sequence prior to its insertion into a vector may be 
desirable or necessary depending on the expression vector. The techniques for modifying 

25 nucleic acid sequences utilizing recombinant DNA methods are well known in the art 

The term "control sequences'* is defined herein to include all components which 
are necessary or advantageous for the expression of a polypeptide of the present 
invention. Each control sequence may be native or foreign to the nucleic acid sequence 
encoding the polypeptide. Such control sequences include, but are not limited to, a 

30 leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide 
sequence, and transcription terminator. At a minimum, the control sequences include a 
promoter, and transcriptional and translational stop signals. The control sequences may 
be provided with linkers for the purpose of introducing specific restriction sites 
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facilitating ligation of the control sequences with the coding region of the nucleic acid 
sequence encoding a polypeptide. The term "operably linked" is defined herein as a 
configuration in which a control sequence is appropriately placed at a position relative to 
the coding sequence of the DNA sequence such that the control sequence directs the 
5 expression of a polypeptide. 

The control sequence may be an appropriate promoter sequence, a nucleic acid 
sequence which is recognized by a host cell for expression of the nucleic acid sequence. 
The promoter sequence contains transcriptional control sequences which mediate the 
expression of the polypeptide. The promoter may be any nucleic acid sequence which 

10 shows transcriptional activity in the host cell of choice including mutant, truncated, and 
hybrid promoters, and may be obtained from genes encoding extracellular or intracellular 
polypeptides either homologous or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of the nucleic acid 
constructs of the present invention, especially in a bacterial host cell, are the promoters 

15 obtained from the E. coli lac operon, Streptomyces coettcolor agarase gene (dagA), 
Bacillus subtilis levansucrase gene (sacB), Bacillus licheniformis alpha-amylase gene 
(amyL) y Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus 
amyloliquefaciens alpha-amylase gene (amyQ), Bacillus licheniformis penicillinase gene 
(penP), Bacillus subtilis xylA and xylB genes, and prokaryotic beta-lactamase gene 

20 (Villa-Kamaroff et al y 1978, Proceedings of the National Academy of Sciences USA 75: 
3727-3731), as well as the tac promoter (DeBoer et aL, 1983, Proceedings of the 
National Academy of Sciences USA 80: 21-25). Further promoters are described in 
"Useful proteins from recombinant bacteria" in Scientific American, 1980, 242: 74-94; 
and in Sambrook et aL , 1 989, supra. 

25 Examples of suitable promoters for directing the transcription of the nucleic acid 

constructs of the present invention in a filamentous fungal host cell are promoters 
obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei 
aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable 
alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), 

30 Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose 
phosphate isomerase, Aspergillus nidulans acetamidase, and Fusarium oxysporum 
trypsin-like protease (WO 96/00787), as well as the NA2-tpi promoter (a hybrid of the 
promoters from the genes for Aspergillus niger neutral alpha-amylase and Aspergillus 
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oryzae triose phosphate isomerase), and mutant, truncated, and hybrid promoters thereof. 

In a yeast host, useful promoters are obtained from the genes for Saccharomyces 
cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), 
Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate 
5 dehydrogenase (ADH2/GAP), and Saccharomyces cerevisiae 3-phosphoglycerate kinase. 
Other useful promoters for yeast host cells are described by Romanos et aL, 1992, Yeast 
8:423-488. 

The control sequence may also be a suitable transcription terminator sequence, a 
sequence recognized by a host cell to terminate transcription. The terminator sequence is 
10 operably linked to the 3* terminus of the nucleic acid sequence encoding the polypeptide. 
Any terminator which is functional in the host cell of choice may be used in the present 
invention. 

Preferred terminators for filamentous fungal host cells are obtained from the 
genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, 
15 Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha-glucosidase, and 
Fusarium oxysporum trypsin-like protease. 

Preferred terminators for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), 
and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful 
20 terminators for yeast host cells are described by Romanos et al> 1 992, supra. 

The control sequence may also be a suitable leader sequence, a nontranslated 
region of an mRNA which is important for translation by the host cell. The leader 
sequence is operably linked to the 5* terminus of the nucleic acid sequence encoding the 
polypeptide. Any leader sequence that is functional in the host cell of choice may be 
25 used in the present invention. 

Preferred leaders for filamentous fungal host cells are obtained from the genes for 
Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase. 

Suitable leaders for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae enolase (ENO- 1), Saccharomyces cerevisiae 3- 
30 phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces 
cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase 
(ADH2/GAP). 

The control sequence may also be a polyadenylaiion sequence, a sequence 

-17- 


NZNA-00 13449 


WO 01729222 


PCT/USOO/29140 


operably linked to the 3* terminus of the nucleic acid sequence and which, when 
transcribed, is recognized by the host cell as a signal to add polyadenosine residues to 
transcribed mRNA. Any polyadenylation sequence which is functional in the host cell of 
choice may be used in the present invention. 
5 Preferred polyadenylation sequences for filamentous fungal host cells are 

obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger 
gluco amylase, Aspergillus nidulans anthranilate synthase, Fusarium oxysporum trypsin- 
like protease, and Aspergillus niger alpha-ghscosidase. 

Useful polyadenylation sequences for yeast host cells are described by Guo and 

10 Sherman, 1995, Molecular Cellular Biology 15: 5983-5990. 

The control sequence may also be a signal peptide coding region that codes for an 
amino acid sequence linked to the amino terminus of a polypeptide and directs the 
encoded polypeptide into the cell's secretory pathway. The 5' end of the coding 
sequence of the nucleic acid sequence may inherently contain a signal peptide coding 

15 region naturally linked in translation reading frame with the segment of the coding region 
which encodes the secreted polypeptide. Alternatively, the 5* end of the coding sequence 
may contain a signal peptide coding region which is foreign to the coding sequence. The 
foreign signal peptide coding region may be required where the coding sequence does not 
naturally contain a signal peptide coding region. Alternatively, the foreign signal peptide 

20 coding region may simply replace the natural signal peptide coding region in order to 
enhance secretion of the polypeptide. However, any signal peptide coding region which 
directs the expressed polypeptide into the secretory pathway of a host cell of choice may 
be used in the present invention. 

Effective signal peptide coding regions for bacterial host cells are the signal 

25 peptide coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic 
amylase, Bacillus stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, 
Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus neutral proteases 
(nprT, nprSy nprM), and Bacillus subtilis prsA. Further signal peptides are described by 
Simonen and Palva, 1993, Microbiological Reviews 57: 109-137. 

*0 Effective signal peptide coding regions for filamentous fungal host cells are the 

signal peptide coding regions obtained from the genes for Aspergillus oryzae TAKA 
amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor 
miehei aspartic proteinase, Humicola insolens cellulase, and Humicola lanuginosa lipase. 
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In a preferred embodiment, the signal peptide coding region is nucleotides 510 to 
567 of SEQ ID NO. 1 which encode amino acids 1 to 1 9 of SEQ ID NO. 2. 

Useful signal peptides for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae alpha-fector and Saccharomyces cerevisiae invertase. Other 
5 useful signal peptide coding regions are described by Romanos et al> 1992, supra. 

The control sequence may also be a propeptide coding region that codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. ' The resultant 
polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). 
A propolypeptide is generally inactive and can be converted to a mature active 
polypeptide by catalytic or autocatalytic cleavage of the propeptide fiom the 
propolypeptide. The propeptide coding region may be obtained from the genes for 
Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), 
Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, and 
Myceliophthora thermophila laccase (WO 95/33836). 
15 Where both signal peptide and propeptide regions are present at the amino 

terminus of a polypeptide, the propeptide region is positioned next to the amino terminus 
of a polypeptide and the signal peptide region is positioned next to the amino terminus of 
the propeptide region. 

It may also be desirable to add regulatory sequences which allow the regulation of 
20 the expression of the polypeptide relative to the growth of the host cell Examples of 
regulatory systems are those which cause the expression of the gene to be turned on or 
off in response to a chemical or physical stimulus, including the presence of a regulatory 
compound. Regulatory systems in prokaryotic systems include the lac, tac, and trp 
operator systems. In yeast, the ADH2 system or GAL1 system may be used. In 
25 filamentous fungi, the TAKA alpha-amylase promoter, Aspergillus niger glucoamylase 
promoter, and Aspergillus oryzae glucoamylase promoter may be used as regulatory 
sequences. Other examples of regulatory sequences are those which allow for gene 
amplification. In eukaryotic systems, these include the dihydrofolate reductase gene 
which is amplified in the presence of methotrexate, and the metallothionein genes which 
30 are amplified with heavy metals. In these cases, the nucleic acid sequence encoding the 
polypeptide would be operably linked with the regulatory sequence. 

The present invention also relates to nucleic acid constructs for altering the 
expression of an endogenous gene encoding a polypeptide of the present invention. The 
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constructs may contain the minimal number of components necessary for altering 
expression of the endogenous gene. In one embodiment, the nucleic acid constructs 
preferably contain (a) a targeting sequence, (b) a regulatory sequence, (c) an exon, and 
(d) a splice-donor site. Upon introduction of the nucleic acid construct into a cell, the 
5 construct inserts by homologous recombination into the cellular genome at the 
endogenous gene site. The targeting sequence directs the integration of elements (a>(d) 
into the endogenous gene such that elements (b)-(d) are operably linked to the 
endogenous gene. In another embodiment, the nucleic acid constructs contain (a) a 
targeting sequence, (b) a regulatory sequence, (c) an exon, (d) a splice-donor site, (e) an 
intron, and (f) a splice-acceptor site, wherein the targeting sequence directs the 
integration of elements (a)-(f) such that elements (bHl) are operably linked to the 
endogenous gene. However, the constructs may contain additional components such as a 
selectable marker. 

In both embodiments, the introduction of these components results in production 
15 of a new transcription unit in which expression of the endogenous gene is altered. La 
essence, the new transcription unit is a fusion product of the sequences introduced by the 
targeting constructs and the endogenous gene. In one embodiment in which the 
endogenous gene is altered, the gene is activated. In this embodiment, homologous 
recombination is used to replace, disrupt, or disable the regulatory region normally 
20 associated with the endogenous gene of a parent cell through the insertion of a regulatory 
sequence which causes the gene to be expressed at higher levels than evident in the 
corresponding parent cell. The activated gene can be further amplified by the inclusion 
of an amplifiable selectable marker gene in the construct using methods well known in 
the art (see, for example, U.S. Patent No. 5,641,670). In another embodiment in which 
25 the endogenous gene is altered, expression of the gene is reduced. 

The targeting sequence can be within the endogenous gene, immediately adjacent 
to the gene, within an upstream gene, or upstream of and at a distance from the 
endogenous gene. One or more targeting sequences can be used. For example, a circular 
plasmid or DNA fragment preferably employs a single targeting sequence, while a linear 
30 plasmid or DNA fragment preferably employs two targeting sequences. 

The regulatory sequence of the construct can be comprised of one or more 
promoters, enhancers, scaffold-attachment regions or matrix attachment sites, negative 
regulatory elements, transcription binding sites, or combinations of these sequences. 
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The constructs further contain one or more exons of the endogenous gene. An 
exon is defined as a DNA sequence which is copied into RNA and is present in a mature 
mRNA molecule such that the exon sequence is in-frame with the coding region of the 
endogenous gene. The exons can, optionally, contain DNA which encodes one or more 
5 amino acids and/or partially encodes an amino acid. Alternatively, the exon contains 
DNA which corresponds to a 5' non-encoding region. Where the exogenous exon or 
exons encode one or more amino acids and/or a portion of an amino acid, the nucleic acid 
construct is designed such that, upon transcription and splicing, the reading frame is in- 
. frame with the coding region of the endogenous gene so that the appropriate reading 
1 0 frame of the portion of the mRNA derived from the second exon is unchanged. 

The splice-donor site of the constructs directs the splicing of one exon to another 
exon. Typically, the first exon lies 5* of the second exon, and the splice-donor site 
overlapping and flanking the first exon on its 3' side recognizes a splice-acceptor site 
flanking the second exon on the 5' side of the second exon. A splice-acceptor site, like a 
15 splice-donor site, is a sequence which directs the splicing of one exon to another exon. 
Acting in conjunction with a splice-donor site, the splicing apparatus uses a splice- 
acceptor site to effect the removal of an intron. 

Expression Vectors 

20 The present invention also relates to recombinant expression vectors comprising a 

nucleic acid sequence of the present invention, a promoter, and transcriptional and 
translational stop signals. The various nucleic acid and control sequences described 
above may be joined together to produce a recombinant expression vector which may 
include one or more convenient restriction sites to allow for insertion or substitution of 

25 the nucleic acid sequence encoding the polypeptide at such sites. Alternatively, the 
nucleic acid sequence of the present invention may be expressed by inserting the nucleic 
acid sequence or a nucleic acid construct comprising the sequence into an appropriate 
vector for expression. In creating the expression vector, the coding sequence is located in 
the vector so that the coding sequence is operably linked with the appropriate control 

30 sequences for expression. 

The recombinant expression vector may be any vector (eg:, a plasmid or virus) 
which can be conveniently subjected to recombinant DNA procedures and can bring 
about the expression of the nucleic acid sequence. The choice of the vector will typically 
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depend on the compatibility of the vector with the host cell into which the vector is to be 
introduced. The vectors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, £e, a vector which exists 
as an extrachromosomal entity, the replication of which is independent of chromosomal 
5 replication, eg., a plasmid, an extrachromosomal element, a minichromosome, or an 
artificial chromosome. The vector may contain any means for assuring self-replication. 
Alternatively, the vector may be one which, when introduced into the host cell, is 
integrated into the genome and replicated together with the chromosome(s) into which it 
has been integrated. Furthermore, a single vector or plasmid or two or more vectors or 
10 plasmids which together contain the total DNA to be introduced into the genome of the 
host cell, or a transposon may be used. 

The vectors of the present invention preferably contain one or more selectable 
markers which permit easy selection of transformed cells. A selectable marker is a gene 
the product of which provides for biocide or viral resistance, resistance to heavy metals, 
15 prototrophy to auxotrophs, and the like. Examples of bacterial selectable markers are the 
dal genes from Bacillus subtilis or Bacillus licheniformis, or markers which confer 
antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or tetracycline 
resistance. Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, 
TRP1, and URA3. Selectable markers for use in a filamentous fungal host cell include, 
20 but are not limited to, amdS (acetamidase), argB (ornithine caibamoyltransferase), bar 
(phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD (nitrate 
reductase), pyrG (orotidine-5%phosphate decarboxylase), sC (sulfate adenyltransferase), 
and trpC (anthranilate synthase), as well as equivalents thereof. Preferred for use in an 
Aspergillus cell are the amdS and pyrG genes of Aspergillus mdulans or Aspergillus 
25 oryzae and the bar gene of Streptomyces hygroscopicus. 

The vectors of the present invention preferably contain an elements) that permits 
integration of the vector into the host cell's genome or autonomous replication of the 
vector m the cell independent of the genome. 

For integration into the host cell genome, the vector may rely on the nucleic acid 
30 sequence encoding the polypeptide or any other element of the vector for integration of 
the vector into the genome by homologous or nonhomologous recombination. 
Alternatively, the vector may contain additional nucleic acid sequences for directing 
integration by homologous recombination into the genome of the host celL The 
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additional nucleic acid sequences enable the vector to be integrated into the host cell 
genome at a precise location(s) in the chn>mosome(s). To increase the likelihood of 
integration at a precise location, the integrational elements should preferably contain a 
sufficient number of nucleic acids, such as 100 to 10,000 base pairs, preferably 400 to 
5 10,000 base pairs, and most preferably 800 to 10,000 base pairs, which are highly 
homologous with the corresponding target sequence to enhance the probability of 
homologous recombination. The integrational elements may be any sequence that is 
homologous with the target sequence in the genome of the host celL Furthermore, the 
integrational elements may be non-encoding or encoding nucleic acid sequences. On the 

10 other hand, the vector may be integrated into the genome of the host cell by non- 
homologous recombination. 

For autonomous replication, the vector may further comprise an origin of 
replication enabling the vector to replicate autonomously in the host cell in question. 
Examples of bacterial origins of replication are the origins of replication of plasmids 

15 pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and 
pUBl 10, pE194, pTA1060, and pAMfil permitting replication in Bacillus. Examples of 
origins of replication for use in a yeast host cell are the 2 micron origin of replication, 
ARS1, ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and 
CEN6. The origin of replication may be one having a mutation which makes its 

20 functioning temperature-sensitive in the host cell (see, eg., Ehrlich, 1978, Proceedings of 
the National Academy of Sciences USA 75: 1 433). 

More than one copy of a nucleic acid sequence of the present invention may be 
inserted into the host cell to increase production of the gene product An increase in the 
copy number of the nucleic acid sequence can be obtained by integrating at least one 

25 additional copy of the sequence into the host cell genome or by including an amplifiable 
selectable marker gene with the nucleic acid sequence where cells containing amplified 
copies of the selectable marker gene, and thereby additional copies of the nucleic acid 
sequence, can be selected for by cultivating the cells in the presence of the appropriate 
selectable agent. 

30 The procedures used to ligate the elements described above to construct the 

recombinant expression vectors of the present invention are well known to one skilled in 
the art (see, eg., Sambrook et al y 1989, supra). 
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Host Cells 

The present invention also relates to recombinant host cells, comprising a nucleic 
acid sequence of the invention, which are advantageously used in the recombinant 
production of the polypeptides. A vector comprising a nucleic acid sequence of the 
5 present invention is introduced into a host cell so that the vector is maintained as a 
chromosomal integrant or as a self-replicating extra-chromosomal vector as described 
earlier. The term "host cell" encompasses any progeny of a patent cell that is not 
identical to the parent cell due to mutations that occur during replication. The choice of a 
host cell will to a large extent depend upon the gene encoding the polypeptide and its 
10 source. 

The host cell may be a unicellular microorganism, eg.* a prokaryote, or a non- 
unicellular microorganism, eg., a eukaryote. 

Useful unicellular cells are bacterial cells such as gram positive bacteria 
including, but not limited to, a Bacillus cell, eg., Bacillus alkalophilus, Bacillus 

15 amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus 
coagulans 9 Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, 
Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a 
Streptomyces cell, e.g., Streptomyces lividans and Streptomyces murinus, or gram 
negative bacteria such as R coli and Pseudomonas sp. In a preferred embodiment, the 

20 bacterial host cell is a Bacillus lentus, Bacillus licheniformis, Bacillus 
stearothermophilus, or Bacillus subtilis cell. In another preferred embodiment, the 
Bacillus cell is an alkalophilic Bacillus. 

The introduction of a vector into a bacterial host cell may, for instance, be 
effected by protoplast transformation (see, eg., Chang and Cohen, 1979, Molecular 

25 General Genetics 168: 111-115), using competent cells (see, eg., Young and Spizizin, 
1961, Journal of Bacteriology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, 
Journal of Molecular Biology 56: 209-221), electroporation (see, eg., Shigekawa and 
Dower, 1988, Biotechniques 6: 742-751), or conjugation (see, eg., Koehler and Thome, 
1987, Journal of Bacteriology 169: 5771-5278). 

30 The host cell may be a eukaryote, such as a mammalian, insect, plant, or fungal 

celL 

In a preferred embodiment, the host cell is a fungal cell. "Fungi" as used herein 
includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as 
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defined by Hawksworth et aL, In, Ainsworth and Bisby's Dictionary of The Fungi, Sih 
edition, 1995, CAB International, University Press, Cambridge, UK) as well as the 
Oomycota (as cited in Hawksworth et al, 1995, supra, page 171) and all mitosporic 
fungi (Hawksworth et al, 1995, supra). 
5 In a more preferred embodiment, the fungal host cell is a yeast cell. 'YeasT as 

used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, 
and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of 
yeast may change in the future, for the purposes of this invention, yeast shall be defined 
as described in Biology and Activities of Yeast (Skinner, FA., Passmore, SM., and 
10 Davenport, RJL, eds, Sloe. App Bacteriol Symposium Series No. 9, 1980). 

In an even more preferred embodiment, the yeast host cell is a Candida, 
Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia 
cell 

In a most preferred embodiment, the yeast host cell is a Saccharomyces 
15 carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces 
douglasii, Saccharomyces Muyveri, Saccharomyces norbensis or Saccharomyces 
oviformis cell. In another most preferred embodiment, the yeast host cell is a 
Kluyveromyces lactis cell. In another most preferred embodiment, the yeast host cell is a 
Yarrowia lipofytica cell. 

20 In another more preferred embodiment, the fungal host cell is a filamentous 

fungal cell. 'Tilamentous fungi" include all filamentous forms of the subdivision 
Eumycota and Oomycota (as defined by Hawksworth et al, 1995, supra). The 
filamentous fungi are generally characterized by a mycelial wall composed of chitin, 
cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative 

25 growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, 
vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a 
unicellular thalhis and carbon catabolism may be fermentative. 

In an even more preferred embodiment, the filamentous fungal host cell is a cell 
of a species o£ but not limited to, Acremonhan y Aspergillus, Fusarium, Humicola, 

30 Mucor, Myceliophthora, Neurospora, Penidllium, Thielavia, Tolypocladium, or 
Trichoderma. 

In a most preferred embodiment, the filamentous fungal host cell is an Aspergillus 
awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus 
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niger or Aspergillus oryzae cell. In another most preferred embodiment, the filamentous 
fungal host cell is a Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, 
Fusarium cuhnorum, Fusarium graminearum, Fusarium gramimim, Fusarium 
heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reficulatum, 
5 Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium 
sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium 
trichothecioides, or Fusarium venenatum cell. In an even most preferred embodiment, 
the filamentous fungal parent cell is a Fusarium venenatum (Nirenberg sp. nov.) cell. In 
another most preferred embodiment, the filamentous fungal host cell is a Humicola 

10 insolens, Humicola lanuginosa, Mucor miehei> Myceliophthora thermophila, Neurospora 
crassa, Penicillium purpurogenum, Thielavia terrestris, Trichoderma harzianum, 
Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or 
Trichoderma viride cell. 

Fungal cells may be transformed by a process involving protoplast formation, 

15 transformation of the protoplasts, and regeneration of the cell wall in a manner known 
per se. Suitable procedures for transformation of Aspergillus host cells are described in 
EP 238 023 and Yelton et al 9 1984, Proceedings of the National Academy of Sciences 
USA 81: 1470-1474. Suitable methods for transforming Fusarium species are described 
by Malardier et aL, 1989, Gene 78: 147-156, and WO 96/00787. Yeast may be 

20 transformed using the procedures described by Becker and Guarente, In Abelson, J.N. 
and Simon, ML, editors, Guide to Yeast Genetics and Molecular Biology, Methods in 
Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et aL, 1983, 
Journal of Bacteriology 153: 163; and Hiimen et aL, 1978, Proceedings of the National 
Academy of Sciences USA 75: 1920. 

25 

Methods of Production 

The present invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a strain, which in its wild-type form is 
capable of producing the polypeptide, to produce a supernatant comprising the 
30 polypeptide; and (b) recovering the polypeptide. Preferably, the strain is of the genus 
Aspergillus, and more preferably Aspergillus oryzae. 

The present invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a host cell under conditions conducive for 
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production of the polypeptide; and (b) recovering the polypeptide. 

The present invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a host cell under conditions conducive for 
production of the polypeptide, wherein the host cell comprises a mutant nucleic acid 
sequence having at least one mutation in the mature polypeptide coding region of SEQ 
ID NO. 1, wherein the mutant nucleic acid sequence encodes a polypeptide which 
consists of amino acids 20 to 464 of SEQ ID NO. 2, and (b) recovering the polypeptide. 

The present invention further relates to methods for producing a polypeptide of 
the present invention comprising (a) cultivating a homologously recombinant cell, having 
incorporated therein a new transcription unit comprising a regulatory sequence, an exon, 
and/or a splice donor site operably linked to a second exon of an endogenous nucleic acid 
sequence encoding the polypeptide, under conditions conducive for production of the 
polypeptide; and (b) recovering the polypeptide. The methods are based on the use of 
gene activation technology, for example, as described in U.S. Patent No. 5,641,670. 

In the production methods of the present invention, the cells are cultivated in a 
nutrient medium suitable for production of the polypeptide using methods known in the 
art For example, the cell may be cultivated by shake flask cultivation, and small-scale or 
large-scale fermentation (including continuous, batch, fed-batch, or solid state 
fermentations) in laboratory or industrial fermentors performed in a suitable medium and 
under conditions allowing the polypeptide to be expressed and/or isolated. The 
cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen 
sources and inorganic salts, using procedures known in the art. Suitable media are 
available from commercial suppliers or may be prepared according to published 
compositions in catalogues of the American Type Culture Collection). If the 
polypeptide is secreted into the nutrient medium, the polypeptide can be recovered 
directly from the medium. If the polypeptide is not secreted, it can be recovered from 
cell ly sates. 

The polypeptides may be detected using methods known in the art that are 
specific for the polypeptides. These detection methods may include use of specific 
antibodies, formation of an enzyme product, or disappearance of an enzyme substrate. 
For example, an enzyme assay may be used to determine the activity of the polypeptide 
as described herein. 

The resulting polypeptide may be recovered by methods known in the art. For 
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example, the polypeptide may be recovered from the nutrient medium by conventional 
procedures including, but not limited to, centrifiigation, filtration, extraction, spray- 
drying, evaporation, or precipitation. 

The polypeptides of the present invention may be purified by a variety of 
5 procedures known in the art including, but not limited to, chromatography (e.g* y ion 
exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic 
procedures (e.g., preparative isoelectric focusing), differential solubility (e.g. 9 ammonium 
sulfate precipitation), SDS-PAGE, or extraction (see, e.g. 9 Protein Purification, J.-C. 
Janson and Lars Ryden, editors, VCH Publishers, New York, 1989). 

10 

Plants 

The present invention also relates to a transgenic plant, plant part, or plant cell 
which has been transformed with a nucleic acid sequence encoding a polypeptide having 
phospholipase B activity of the present invention so as to express and produce the 
1 5 polypeptide in recoverable quantities. The polypeptide may be recovered from the plant 
or plant part. Alternatively, the plant or plant part containing the recombinant 
polypeptide may be used as such for improving the quality of a food or feed, e.g. 9 
improving nutritional value, palatability, and Theological properties, or to destroy an 
antinutritive factor. 

20 The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a 

monocot). Examples of monocot plants are grasses, such as meadow grass (blue grass, 
Poa), forage grass such as festuca, lolium, temperate grass, such as Agrostis, and cereals, 
e.g., wheat, oats, rye, barley, rice, sorghum, and maize (corn). 

Examples of dicot plants are tobacco, legumes, such as lupins, potato, sugar 
25 beet, pea, bean and soybean, and cruciferous plants (family Brassicaceae), such as 
cauliflower, rape seed, and the closely related model organism Arabidopsis thaliana. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers. 
Also specific plant tissues, such as chloroplast, apoplast, mitochondria, vacuole, 
peroxisomes, and cytoplasm are considered to be a plant part. Furthermore, any plant 
30 cell, whatever the tissue origin, is considered to be a plant part 

Also included within the scope of the present invention are the progeny of such 
plants, plant parts and plant cells. 

The transgenic plant or plant cell expressing a polypeptide of the present 
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invention may be constructed in accordance with methods known in die art Briefly, the 
plant or plant cell is constructed by incorporating one or more expression constructs 
encoding a polypeptide of the present invention into the plant host genome and 
propagating the resulting modified plant or plant cell into a transgenic plant or plant cell. 
5 Conveniently, the expression construct is a nucleic acid construct which 

comprises a nucleic acid sequence encoding a polypeptide of the present invention 
operably linked with appropriate regulatory sequences required for expression of the 
nucleic acid sequence in the plant or plant part of choice. Furthermore, the expression 
construct may comprise a selectable marker useful for identifying host cells into which 

10 the expression construct has been integrated and DNA sequences necessary for 
introduction of the construct into the plant in question (the latter depends on the DNA 
introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator sequences 
and optionally signal or transit sequences is determined, for example, on the basis of 

15 when, where, and how the polypeptide is desired to be expressed. For instance, the 
expression of the gene encoding a polypeptide of the present invention may be 
constitutive or inducible, or may be developmental, stage or tissue specific, and the gene 
product may be targeted to a specific tissue or plant part such as seeds or leaves. 
Regulatory sequences are, for example, described by Tague et ai, 1988, Plant 

20 Physiology Z6: 506. 

For constitutive expression, the 35S-CaMV promoter may be used (Franck et 
al, 1980, Cell 21 : 285-294). Organ-specific promoters may be, for example, a promoter 
from storage sink tissues such as seals, potato tubers, and fruits (Edwards & Coruzzi, 
1990, Ann. Rev. Genet. 24: 275-303), or from metabolic sink tissues such as meristems 

25 (Ito et al y 1994, Plant Mol Biol 24: 863-878), a seed specific promoter such as the 
glutelin, prolamin, globulin, or albumin promoter from rice (Wu et al. 9 1998, Plant and 
Cell Physiology 39: 885-889), a Vicia faba promoter from the legumin B4 and the 
unknown seed protein gene from Vicia faba (Conrad et aL> 1998, Journal of Plant 
Physiology 152: 708-711), a promoter from a seed oil body protein (Chen et aL, 1998, 

30 Plant and Cell Physiology 39: 935-941), the storage protein napA promoter from 
Brassica napus, or any other seed specific promoter known in the art, e.g., as described in 
WO 91/14772. Furthermore, the promoter may be a leaf specific promoter such as the 
rbcs promoter from rice or tomato (Kyozuka et aL, 1993, Plant Physiology 102: 991- 
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1000, the chlorella virus adenine metfayltransferase gene promoter (Mitra and Higgins, 
1994, Plant Molecular Biology 26: 85-93), or the aldP gene promoter from rice (Kagaya 
et aL 9 1995, Molecular and General Genetics 248: 668-674), or a wound inducible 
promote such as the potato pin2 promoter (Xu et al 9 1993, Plant Molecular Biology 22: 
5 573-588). 

A promoter enhancer element may also be used to achieve higher expression of 
the enzyme in the plant For instance, the promote* enhancer element may be an intron 
which is placed between the promoter and the nucleotide sequence encoding a 
polypeptide of the present invention. For instance, Xu et aL> 1993, supra disclose the use 
0 of the first intron of the rice actin 1 gene to enhance expression. 

The selectable marker gene and any other parts of the expression construct may 
be chosen from those available in the art 

The nucleic acid construct is incorporated into the plant genome according to 
conventional techniques known in the art, including Agrobacterium-medizted 
5 transformation, virus-mediated transformation, microinjection, particle bombardment, 
biolistic transformation, and electroporation (Gasser et al., 1990, Science 244: 1293; 
Potrykus, 1990, Bio/Technology 8: 535; Shimamoto et aL, 1989, Nature 338: 274). 

Presently, Agrobacterhim tumefaciens-medi&ted gene transfer is the method of 
choice for generating transgenic dicots (for a review, see Hooykas and Schilperoort, 
0 1992, Plant Molecular Biology 19: 15-38). However it can also be used for transforming 
monocots, although other transformation methods are generally preferred for these plants. 
Presently, the method of choice for generating transgenic monocots is particle 
bombardment (microscopic gold or tungsten particles coated with the transforming 
DNA) of embryonic calK or developing embryos (Christou, 1992, Plant Journal 2: 275- 
5 281; Shimamoto, 1994; Current Opinion Biotechnology 5: 158-162; Vasil et al., 1992, 
Bio/Technology 10: 667-674). An alternative method for transformation of monocots is 
based on protoplast transformation as described by Qmirulleh et aL, 1993, Plant 
Molecular Biology 21: 415-428. 

Following transformation, the transformants having incorporated therein the 
) expression construct are selected and regenerated into whole plants according to methods 
well-known in the art 

The present invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a transgenic plant or a plant cell comprising 
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a nucleic acid sequence encoding a polypeptide having phospholipase B activity of the 
present invention under conditions conducive for production of the polypeptide; and (b) 
recovering the polypeptide. 

5 Removal or Reduction of Phospholipase B Activity 

The present invention also relates to methods for producing a mutant cell of a 
parent cell, which comprises disrupting or deleting a nucleic acid sequence encoding the 
polypeptide or a control sequence thereof which results in the mutant cell producing less 
of the polypeptide than the parent cell when cultivated under the same conditions. 

10 The construction of strains which have reduced phospholipase B activity may be 

conveniently accomplished by modification or inactivation of a nucleic acid sequence 
necessary for expression of the polypeptide having phospholipase B activity in the cell. 
The nucleic acid sequence to be modified or inactivated may be, for example, a nucleic 
acid sequence encoding the polypeptide or a part thereof essential for exhibiting 

15 phospholipase B activity, or the nucleic acid sequence may have a regulatory function 
required for the expression of the polypeptide from the coding sequence of the nucleic 
acid sequence. An example of such a regulatory or control sequence may be a promoter 
sequence or a functional part thereof Le. y a part which is sufficient for affecting 
expression of the polypeptide. Other control sequences for possible modification are 

20 described above. 

Modification or inactivation of the nucleic acid sequence may be performed by 
subjecting die cell to mutagenesis and selecting or screening for cells in which the 
phospholipase B producing capability has been reduced. The mutagenesis, which may be 
specific or random, may be performed, for example, by use of a suitable physical or 

25 chemical mutagenizing agent, by use of a suitable oligonucleotide, or by subjecting the 
DNA sequence to PCR generated mutagenesis. Furthermore, the mutagenesis may be 
performed by use of any combination of these mutagenizing agents. 

Examples of a physical or chemical mutagenizing agent suitable for the present 
purpose include ultraviolet (UV) irradiation, hydroxylamine, N-methyl-K-nitro-N- 

30 nitrosoguanidine (MNNG), O-methyl hydroxylamine, nitrous acid, ethyl methane 
sulphonate (EMS), sodium bisulphite, formic acid, and nucleotide analogues. 

When such agents are used, the mutagenesis is typically performed by incubating 
the cell to be mutagenized in the presence of the mutagenizing agent of choice under 
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suitable conditions, and selecting for cells exhibiting reduced phospholipase B activity or 
production. 

Modification or inactivation of production of a polypeptide of the present 
invention may be accomplished by introduction, substitution, or removal of one or more 
5 nucleotides in the nucleic acid sequence encoding the polypeptide or a regulatory element 
required for the transcription or translation thereof. For example, nucleotides may be 
inserted or removed so as to result in the introduction of a stop codon, the removal of the 
start codon, or a change of the open reading frame. Such modification or inactivation 
may be accomplished by site-directed mutagenesis or PCR generated mutagenesis in 

10 accordance with methods known in the art Although, in principle, the modification may 
be performed in vivo, Le. 9 directly on the cell expressing the nucleic acid sequence to be 
modified, it is preferred that the modification be performed in vitro as exemplified below. 

An example of a convenient way to eliminate or reduce production by a host cell 
of choice is by gene replacement or gene interruption. In the gene interruption method, a 

15 nucleic acid sequence corresponding to the endogenous gene or gene fragment of interest 
is mutagenized in vitro to produce a defective nucleic acid sequence which is then 
transformed into the host cell to produce a defective gene. By homologous 
recombination, the defective nucleic acid sequence replaces the endogenous gene or gene 
fragment It may be desirable that the defective gene or gene fragment also encodes a 

20 marker which may be used for selection of transformants in which the gene encoding the 
polypeptide has been modified or destroyed. 

Alternatively, modification or inactivation of the nucleic acid sequence may be 
performed by established anti-sense techniques using a nucleotide sequence 
complementary to the polypeptide encoding sequence. More specifically, production of 

25 the polypeptide by a cell may be reduced or eliminated by introducing a nucleotide 
sequence complementary to the nucleic acid sequence encoding the polypeptide which 
may be transcribed in the cell and is capable of hybridizing to the polypeptide mRNA 
produced in the cell. Under conditions allowing the complementary anti-sense 
nucleotide sequence to hybridize to the polypeptide mRNA, the amount of polypeptide 

30 translated is thus reduced or eliminated. 

It is preferred that the cell to be modified in accordance with the methods of the 
present invention is of microbial origin, for example, a fungal strain which is suitable for 
the production of desired protein products, either homologous or heterologous to the cell. 
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The present invention further relates to a mutant cell of a parent cell which 
comprises a disruption or deletion of a nucleic acid sequence encoding the polypeptide or 
a control sequence thereof; which results in the mutant cell producing less of the 
polypeptide than the parent cell. 
5 The polypeptide-deficient mutant cells so created are particularly useful as host 

cells for the expression of homologous and/or heterologous polypeptides. Therefore, the 
present invention further relates to methods, for producing a homologous or heterologous 
polypeptide comprising (a) cultivating the mutant cell under conditions conducive for 
production of the polypeptide; and (b) recovering the polypeptide. The term 
10 "heterologous polypeptides" is defined herein as polypeptides which are not native to the 
host cell, a native protein in which modifications have been made to altar the native 
sequence, or a native protein whose expression is quantitatively altered as a result of a 
manipulation of the host cell by recombinant DNA techniques. 

In a further aspect, the present invention relates to a method for producing a 
15 protein product essentially free of phospholipase B activity by fermentation of a cell 
which produces both a polypeptide of the present invention as well as the protein product 
of interest by adding an effective amount of an agent capable of inhibiting phospholipase 
B activity to the fermentation broth before, during, or after the fermentation has been 
completed, recovering the product of interest from the fermentation broth, and optionally 
20 subjecting the recovered product to further purification. 

In a further aspect, the present invention relates to a method for producing a 
protein product essentially free of phospholipase B activity by cultivating the cell under 
conditions permitting the expression of the product, subjecting the resultant culture broth 
to a combined pH and temperature treatment so as to reduce the phospholipase B activity 
25 substantially, and recovering the product from the culture broth. Alternatively, the 
combined pH and temperature treatment may be performed on an enzyme preparation 
recovered from the culture broth. The combined pH and temperature treatment may 
optionally be used in combination with a treatment with a phospholipase B inhibitor. 

In accordance with this aspect of the invention, it is possible to remove at least 
30 60%, preferably at least 75%, more preferably at least 85%, still more preferably at least 
95%, and most preferably at least 99% of the phosphqlipase B activity. Complete 
removal of phospholipase B activity may be obtained by use of this method. 

The combined pH and temperature treatment is preferably carried out at a pH in 
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the range of 6.5-8.0 and a temperature in the range of 45-70°C for a sufficient period of 
time to attain the desired effect, where typically, 30 to 60 minutes is sufficient 

The methods used for cultivation and purification of the product of interest may 
be performed by methods known in the art 
5 The methods of the present invention for producing an essentially phospholipase 

B-free product is of particular interest in the production of eukaryotic polypeptides, in 
particular fungal proteins such as enzymes* The enzyme may be selected from, &g., an 
amylolytic enzyme, lipolytic enzyme, proteolytic enzyme, cellulytic enzyme, 
oxidortductase, or plant cell-wall degrading enzyme. Examples of such enzymes include 

10 an aminopeptidase, amylase, amyloglucosidase, carbohydrase, carboxypeptidase, 
catalase, cellulase, chittnase, cutinase, cyclodextrin glycosyltransferase, 
deoxyribonuclease, esterase, galactosidase, beta-gal actosidase, glucoamylase, glucose 
oxidase, glucosidase, haloperoxidase, hemicellulase, invertase, isomerase, laccase, Kgase, 
lipase, lyase, mannosidase, oxidase, pectinolytic enzyme, peroxidase, phytase, 

15 phenoloxidase, polyphenoloxidase, proteolytic enzyme, ribonuclease, transferase, 
transglutaminase, or xylanase. The phospholipase B-deficient cells may also be used to 
express heterologous proteins of pharmaceutical interest such as hormones, growth 
factors, receptors, and the like. 

It will be understood that the term "eukaryotic polypeptides" includes not only 

20 native polypeptides, but also those polypeptides, e.g. t enzymes, which have been 
modified by amino acid substitutions, deletions or additions, or other such modifications 
to enhance activity, thermostability, pH tolerance and the like. 

In a further aspect, the present invention relates to a protein product essentially 
free from phospholipase B activity which is produced by a method of die present 

25 invention. 

Compositions 

In a still further aspect, the present invention relates to compositions comprising a 
polypeptide of the present invention. Preferably, the compositions are enriched in a 
30 polypeptide of the present invention. In the present context, the term "enriched" 
indicates that the phospholipase B activity of the composition has been increased, e.g., 
with an enrichment factor of 1.1. 

The composition may comprise a polypeptide of the invention as the major 
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enzymatic component, a mono-component composition. Alternatively, 'the 
composition may comprise multiple enzymatic activities, such as an aminopeptidase, 
amylase, carbohydrase, carboxypq>tidase v catalase, cellulase, chitinase, cutinase, 
cyclodextrin glycosyltransferase, deoxyribonuclease, esterase, alpha-galactosidase, beta- 
5 galactosidase, glucoamylase, alpha-ghicosidase, beta-glucosidase, haloperoxidase, 
invertase, laccase, lipase, mannosidase, oxidase, pectinolytic enzyme, 
peptidoglutaminase, peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, 
ribonuclease, transglutaminase, or xylanase. The additional enzyme(s) may be produced 
by means of a microorganism belonging to the genus Aspergillus, preferably Aspergillus 

10 aculeatus, Aspergillus awamori, Aspergillus niger, or Aspergillus oryzae, or 
Trichoderma, Humicola, preferably Humicola insolens y or Fusarium, preferably 
Fusarium bactridioides, Fusarium cerealis 9 Fusarium crookwellense, Fusarium 
cuhnorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, 
Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium raseum, 

15 Fusarium sambucinum, Fusarium sarcochroum, Fusarium sulphur eum y Fusarium 
toruloseum, Fusarium trichothecioides, ox Fusarium venenatwn. 

The polypeptide compositions may be prepared in accordance with methods 
known in the art and may be in the form of a liquid or a dry composition. For instance, 
the polypeptide composition may be in the form of a granulate or a mioogranulate. The 

20 polypeptide to be included in the composition may be stabilized in accordance with 
methods known in the art 

Examples are given below of preferred uses of the polypeptide compositions of 
the invention. The dosage of the polypeptide composition of the invention and other 
conditions under which the composition is used may be determined on the basis of 

25 methods known in the art 

Uses 

The present invention is also directed to methods for using the polypeptides 
having phospholipase activity. 
30 The polypeptides of the present invention may be used in any application where it 

is desired to hydrolyze the fatty acyl group(s) of a phospholipid or lysophospholipid, 
such as lecithin or lysolecithin. The polypeptides of the present invention are preferably 
used at a pH optimal for activity. 
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A polypeptide of the present invention may be used for degumming an aqueous 
carbohydrate solution or slurry to improve its filterability, particularly, a starch 
hydrolysate, especially a wheat starch hydrolysate which is difficult to filter and yields 
cloudy filtrates. The treatment may be performed using methods well known in the art 
5 See, for example, EP 219,269 and EP 808,903. 

A polypeptide of the present invention may be used in a process to reduce the 
phospholipid content in an edible oil by treating the oil with the polypeptide to hydrolyze 
a major portion of the phospholipid and separating an aqueous phase containing the 
hydrolyzed phospholipid from the oil. Such a process is applicable to the purification of 
10 any edible oil which contains phospholipid, eg, vegetable oil such as soybean oil, rape 
seed oil, and sunflower oil. 

Prior to the phospholipase treatment, the oil is preferably pretreated to remove 
slime (mucilage), eg., by wet refining. Typically, the oil will contain 50-250 ppm of 
phosphorus as phospholipid at the beginning of the treatment with the phospholipase, and 
15 the treatment may reduce the phosphorus value to below 5-10 ppm. 

The phospholipase treatment is conducted by dispersing an aqueous solution of 
the phospholipase, preferably as droplets with an average diameter below 10 jim. The 
amount of water is preferably 0.5-5% by weight in relation to the oil. An emulsifier may 
optionally be added. Mechanical agitation may be applied to maintain the emulsion. 
20 The phospholipase treatment can be conducted at a pH in the range of about 1 .5 to 

about 5.0; The process pH may be in the range of about 3.5 to about 5 to maximize the 
enzyme's performance, or a pH in the range of about 1.5 to about 3 (eg., 2-3) may be 
used in order to suppress the alkaline hydrolysis of triglycerides (saponification). The 
pH may be adjusted by adding citric acid, a citrate buffer, or hydrochloric acid. 
25 A suitable temperature is generally 30-70°C (particularly 3(M5°C, eg., 35-40°C). 

The reaction time will typically be 1-12 hours (eg., 2-6 hours). A suitable enzyme 
dosage will usually be 0.1-10 mg per liter (eg., 0.5-5 mg per liter). 

The phospholipase treatment may be conducted batchwise, eg., in a tank with 
stirring, or it may be continuous, eg., a series of stirred tank reactors. 
30 The phospholipase treatment is followed by separation of an aqueous phase and 

an oil phase. The separation may be performed by conventional means, eg., 
centrifugation. The aqueous phase will contain phospholipase, and the enzyme may be 
re-used to improve the process economy. 

-36- 


NZNA-00 13468 


WO 01729222 


PCT/US00/29140 


The treatment may be performed using any of the methods known in the ait See, 
for example, U.S. Patent No. 5,264,367, EP 654,527, JP-A 2-153997. 

The present invention also relates to methods for preparing a dough or a baked 
product comprising incorporating into the dough an effective amount of a polypeptide of 
5 the present invention which improves one or more properties of the dough or the baked 
product obtained from the dough relative to a dough or a baked product in which the 
polypeptide is not incorporated. 

The phrase "incorporating into the dough" is defined herein as adding the 
phosphoHpase B to the dough, any ingredient from which the dough is to be made, and/or 
10 any mixture of dough ingredients from which the dough is to be made. In other words, 
the phospholipase B may be added in any step of the dough preparation and may be 
added in one, two, or more steps.. The phospholipase B is added to the ingredients of a 
dough which is kneaded and baked to make the baked product using methods well known 
in the art See, for example, U.S. Patent No/ 4,567,046, EP 426,21 1, JP-A 60-78529, JP- 
15 A 62-1 1 1 629, and JP-A 63-258528. 

The term "effective amount" is defined herein as an amount of the phospholipase 
B that is sufficient for providing a measurable effect on at least one property of interest of 
the dough and/or baked product. 

The term "improved property" is defined herein as any property of a dough and/or 
2Q a product obtained from the dough, particularly a baked product, which is improved by 
the action of the phospholipase B relative to a dough or product in which the 
phospholipase B is not incorporated. The improved property may include, but is not 
limited to, increased strength of the dough, increased elasticity of the dough, increased 
stability of the dough, reduced stickiness of the dough, improved extensibility of the 
25 dough, improved machinability of the dough, increased volume of the baked product, 
improved crumb structure of the baked product, improved softness of the baked product, 
improved flavor of the baked product, and/or improved antistaling of the baked product 

The improved property may be determined by comparison of a dough and/or a 
baked product prepared with and without addition of a polypeptide having phospholipase 
30 B activity of the present invention in accordance with the methods of the present 
invention. Techniques which can be used to determine improvements achieved by use of 
the methods of present invention are described below in the Examples. Organoleptic 
qualities may be evaluated using procedures well established in the baking industry, and 
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may include, for example, the use of a panel of trained taste-testers. 

The term "increased strength of the dough" is defined herein as the property of a 
dough that has generally more elastic properties and/or requires more work input to 
mould and shape. 

5 The term "increased elasticity of the dough" is defined herein as the property of a 

dough which has a higher tendency to regain its original shape after being subjected to a 
certain physical strain. 

The term "increased stability of the dough" is defined herein as the property of a 
dough that is less susceptible to mechanical abuse thus better maintaining its shape and 
10 volume. 

The term "reduced stickiness of the dough" is defined herein as the property of a 
dough that has less tendency to adhere to surfaces, e.g. y in the dough production 
machinery, and is either evaluated empirically by the skilled test baker or measured by 
the use of a texture analyzer (eg., TAXT2) as known in the art. 
15 The term "improved extensibility of the dough** is defined herein as the property 

of a dough that can be subjected to increased strain or stretching without rupture. 

The term "improved machinability of the dough" is defined herein as the property 
of a dough that is generally less sticky and/or more firm and/or more elastic. 

The term "increased volume of the baked product** is measured as the specific 
20 volume of a given loaf of bread (volume/weight) determined typically by the traditional 
rape seed displacement method. 

The term "improved crumb structure of the baked product" is defined herein as 
the property of a baked product with finer and/or thinner cell walls in the crumb and/or 
more uniform/homogenous distribution of cells in the crumb and is usually evaluated 
25 empirically by the skilled test baker. 

The term "improved softness of the baked product" is the opposite of "firmness" 
and is defined herein as the property of a baked product that is more easily compressed 
and is evaluated either empirically by the skilled test baker or measured by the use of a 
texture analyzer (e.g., TAXT2) as known in the art 
30 The term "improved flavor of the baked product" is evaluated by a trained test 

panel. 

The term "improved antistaKng of the baked product" is defined herein as the 
. properties of a baked product that have a reduced rate of deterioration of quality 
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parameters, e.^., softness and/or elasticity, during storage. 

The tenn "dough" is defined herein as a mixture of flour and other ingredients 
finn enough to knead or roll. The dough may be fresh, frozen, pre-bared, or pre-baked. 
The preparation of frozen dough is described by Kulp and Lorenz in Frozen and 
5 Refrigerated Doughs and Batters. 

The term "baked product" is defined herein as any product prepared from a 
dough, either of a soft or a crisp character. Examples of baked products, whether of a 
white, light or dark type, which may be advantageously produced by the present 
invention are bread (in particular white, whole-meal or rye bread), typically in the form 
10 of loaves or rolls, French baguette-type bread, pasta, pita bread, tortillas, tacos, cakes, 
pancakes, biscuits, cookies, pie crusts, steamed bread, and crisp bread, and the like. 

The polypeptide having phospholipase B activity and/or additional enzymes to be 
used in the methods of the present invention may be in any form suitable for the use in 
question, e.g., in the form of a dry powder, agglomerated powder, or granulate, in 
15 particular a non-dusting granulate, liquid, in particular a stabilized liquid, or protected 
enzyme. Granulates and agglomerated powders may be prepared by conventional 
methods, eg., by spraying the phpspholipase B onto a carrier in a fluid-bed granulator. 
The carrier may consist of particulate cores having a suitable particle size. The carrier 
may be soluble or insoluble, eg., a salt (such as NaCl or sodium sulfate), sugar (such as 
20 sucrose or lactose), sugar alcohol (such as sorbitol), starch, rice, com grits, or soy. The 
phospholipase B and/or additional enzymes may be contained in slow-release 
s formulations. Methods for preparing slow-release formulations are well known in the art 
Liquid enzyme preparations may, for instance, be stabilized by adding nutritionally 
acceptable stabilizers such as a sugar, sugar alcohol, or another polyol, and/or lactic acid 
25 or another organic acid according to established methods. 

For inclusion in pre-mixes or flour it is advantageous that the polypeptide having 
phospholipase B activity is in the form of a dry product, eg., a non-dusting granulate, 
whereas for inclusion together with a liquid it is advantageously in a Squid form. 

One or more additional enzymes may also be incorporated into the dough. The 
30 additional enzyme may be of any origin, including mammalian and plant, and preferably 
of microbial (bacterial, yeast or fungal) origin and may be obtained by techniques 
conventionally used in the art 

In a preferred embodiment, the additional enzyme may be an amylase, such as an 
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alpha-amylase (useful for providing sugars fermentable by yeast and retarding staling) or 
beta-amylase, cyclodextrin glucanotransferase, peptidase, in particular, an exopeptidase 
(useful in flavour enhancement), transglutaminase, lipase (useful for the modification of 
lipids present in the dough or dough constituents so as to soften the dough), 
5 phospholipase (useful for the modification of lipids present in the dough or dough 
constituents so as to soften the dough and improve gas retention in the dough), cellulase, 
hemicellulase, in particular a pentosanase such as xylanase (useful for the partial 
hydrolysis of pentosans which increases the extensibility of the dough), protease (useful 
for gluten weakening in particular when using hard wheat flour), protein disulfide 

10 isomerase, e.g., a protein disulfide isomerase as disclosed in WO 95/00636, 
glycosyltransferase, peroxidase (useful for improving the dough consistency), laccase, or 
oxidase, e.g. 9 an aldose oxidase, glucose oxidase, pyranose oxidase, lipoxygenase, or L- 
amino acid oxidase (useful in improving dough consistency). 

When one or more additional enzyme activities are to be added in accordance 

15 with the methods of the present invention, these activities may be added separately or 
together with the polypeptide having phospholipase B activity, optionally as 
constituents) of the bread-improving and/or dough-improving composition. The other 
enzyme activities may be any of the enzymes described above and may be dosed in 
accordance with established baking practices. 

20 The present invention also relates to methods for preparing a baked product, 

comprising baking a dough obtained by a method of the present invention to produce a 
baked product. The baking of the dough to produce a baked product may be performed 
using methods well known in the art 

The present invention also relates to doughs and baked products, respectively, 

25 produced by the methods of the present invention. 

The present invention further relates to a pre-mix, e.g., in the form of a flour 
composition, for dough and/or baked products made from dough, in which the pre-mix 
comprises a polypeptide having phospholipase B activity of the present invention. The 
term "pre-mix" is defined herein to be understood in its conventional meaning, Le., as a 

30 mix of baking agents, generally including flour, which may be used not only in industrial 
bread-baking plants/facilities, but also in retail bakeries. The pre-mix may be prepared 
by mixing the polypeptide or a bread-improving and/or dough-improving composition of 
the invention comprising the polypeptide with a suitable carrier such as flour, starch, a . 
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sugar, or a salt The pre^mix may contain other dough-improving and/or bread- 
improving additives, e.g., any of the additives, including enzymes, mentioned above. 

The present invention further relates to baking additives in the form of a granulate 
or agglomerated powder, which comprise a polypeptide having phospholipase B activity 
5 of the present invention. The baking additive preferably has a narrow particle size 
distribution with more than 95% (by weight) of the particles in the range fiom 25 to 500 
\im. 

Signal Peptide 

10 The present invention also relates to nucleic acid constructs comprising a gene 

encoding a protein opeiably linked to a nucleic acid sequence consisting of nucleotides 
510 to 567 of SEQ ID NO. 1 encoding a signal peptide consisting of amino acids 1 to 19 
of SEQ ID NO. 2, wherein the gene is foreign to the nucleic acid sequence. 

The present invention also relates to recombinant expression vectors and 
1 5 recombinant host cells comprising such a nucleic acid construct. 

The present invention also relates to methods for producing a protein comprising 
(a) cultivating such a recombinant host cell under conditions suitable for production of 
the protein; and (b) recovering the protein. 

The nucleic acid sequence may be operably linked to foreign genes with other 
20 control sequences. Such other control sequences are described above. 

The protein may be native or heterologous to a host cell. The term "protein" is 
not meant herein to refer to a specific length of the encoded product and, therefore, 
encompasses peptides, oligopeptides, and proteins. The term "protein" also 
encompasses two or more polypeptides combined to form the encoded product Hie 
25 proteins also include hybrid polypeptides which comprise a combination of partial or 
complete polypeptide sequences obtained from at least two different proteins wherein 
one or more may be heterologous or native to the host cell. Proteins further include 
naturally occurring allelic and engineered variations of the above mentioned proteins and 
hybrid proteins. 

30 Preferably, the protein is a hormone, hormone variant, enzyme, receptor or a 

portion thereof, antibody or a portion thereof; or reporter. In a more preferred 
embodiment, the protein is an oxidoreductase, transferase, hydrolase, lyase, isomerase, or 
ligase. In an even more preferred embodiment, the protein is an aminopeptidase, 
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amylase, carbohydrase, carboxypeptidase, catalase, celhilase, chitinase, cutinase, 
cyclodextrin glycosyltransferase, deoxyribomiclease, esterase, alpha-galactosidase, beta- 
galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, invertase, laccase, 
lipase, mannosidase, mutanase, oxidase, pectolytic enzyme, peroxidase, phytase, 
5 polyphenoloxidase, proteolytic enzyme, ribonuclease, transglutaminase, or xylanase. 

The gene may be obtained fh>m any prokaiyotic, eukaryotic, or other source. For 
purposes of the present invention, the term "obtained from" as used herein in connection 
with a given source shall mean that the protein is produced by the source or by a cell in 
which a gene from the source has been inserted. 

10 

The present invention is further described by the following examples which 
should not be construed as limiting the scope of the invention. 

15 Examples 

Chemicals used as buffers and substrates were commercial products of at least 
reagent grade. 

20 Example 1: Aspergillus oryzae genomic DNA extraction 

Aspergillus oryzae HowB430 (WO 98/1 1203) was grown in 25 ml of 0.5% yeast 
extract-2% glucose (YEG) medium for 24 hours at 37°C and 250 rpm. Mycelia were 
then collected by filtration through Miracloth (Calbiochem, La Jolla, CA) and washed 
once with 25 ml of 10 mM Tris-1 mM EDTA (TE) buffer. Excess buffer was drained 

25 from the mycelia preparation which was subsequently frozen in liquid nitrogen. The 
frozen mycelia preparation was ground to a fine powder in an electric coffee grinder, and 
the powder was added to a disposable plastic centrifuge tube containing 20 ml of TE 
buffer and 5 ml of 20% w/v sodium dodecylsulfate (SDS). The mixture was gently 
inverted several times to ensure mixing, and extracted twice with an equal volume of 

30 phenohchlorofonnrisoamyl alcohol (25:24:1 v/v/v). Sodium acetate (3 M solution) was 
added to the extracted sample to a final concentration of 03 M followed by 2.5 volumes 
of ice cold ethanol to precipitate the DNA. The tube was centrifuged at 1 5,000 x g for 30 
minutes to pellet the DNA. The DNA pellet was allowed to air-dry for 30 minutes before 
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^suspension in 0.5 ml of TE buffer. DNase-free ribonuclease A was added to the 
resuspended DNA pellet to a concentration of 100 fig per ml and the mixture was then 
incubated at 37°C for 30 minutes. Proteinase K (200 jigfail) was added and the tube was 
incubated an additional one hour at 37°C. Finally, the sample was extracted twice with 
5 phenol:chloroform:isoamyl alcohol and the DNA precipitated with ethanoL The 
precipitated DNA was washed with 70% ethanol, dried under vacuum, resuspended in TE 
buffer, and stored at 4°C. 

Example 2: DNA libraries and identification of Aspergillus oryzae pbospholipase B 
10 clones 

Genomic DNA libraries were constructed using the bacteriophage cloning vector 
AZipLox (life Technologies, Gaithersburg, MD) with E. coli Y1090ZL cells (Life 
Technologies, Gaithersburg, MD) as a host for plating and purification of recombinant 
bacteriophage and E. coli DHlOBzip (Life Technologies, Gaithersburg, MD) for excision 

15 of individual pZLl clones containing the phospholipase B gene. 

Aspergillus oryzae HowB430 genomic DNA was partially digested with 7Sp509I 
and size-fractionated on 1% agarose gels. DNA fragments migrating in the size range 3- 
7 kb were excised and eluted from the gel using Prep-a-Gene reagents (BioRad 
Laboratories, Hercules, CA). The eluted DNA fragments were ligated with EcdKL- 

20 cleaved and dephosphoiylated XZipLox vector arms (life Technologies, Gaithersburg, 
MD), and the ligation mixtures were packaged using commercial packaging extracts 
(Stratagene, La Jolla, CA). The packaged DNA libraries were plated and amplified in 
Escherichia coli Y1090ZL cells (Life Technologies, Gaithersburg, MD). 

A genomic clone designated clone 10.2 #10 was inoculated into three ml of LB 

25 plus 50 jig/ml ampicillin medium and grown overnight at 37°C. Mrniprep DNA was 
prepared from this clone using the Bio Robot 9600 (Qiagen Inc., Valencia, CA) 
according to the manufacturer's protocol. The clone was determined by DNA sequencing 
to contain a portion of a phospholipase B gene. DNA sequencing of each clone was 
performed with an Applied Biosystems Prism 377 DNA Sequencer using the BigDye 

30 Terminator Cycle Sequencing Ready Reaction kit (Applied Biosystems, Inc., Foster City, 
CA) according to the manufacturer's instructions. 

In order to recover additional sequence, a XhoVSaR fragment from the insert in 
this clone was used to probe the same genomic library. 
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The Aspergillus oryzae HowB430 DNA library was plated on NZCYM agar 
plates. Plaque lifts (Maniatis el aL, 1982, Molecular Cloning, A Laboratory Manual^ 
Cold Spring Haibor Press, Cold Spring Harbor, New York) were performed on 
approximately 40,000 colonies and the DNA was fixed onto membranes by heating at 80 
5 °C for two hours. The membranes were soaked for 30 minutes at 65°C in a hybridization 
solution containing 6 x SSPE and 7.0% SDS. The phospholipase B gene fragment 
isolated from the Aspergillus oryzae HowB430 as described above was radiolabeled 
using the Prime-It II Random Primer Labeling Kit (Stratagene, La Jolla, CA), denatured 
by adding NaOH to a final concentration of 0.5 M, and added to the hybridization 

10 solution at an activity of approximately 1 x 10 6 cpm per ml of hybridization solution. 
The mixture was incubated overnight at 65°C in a shaking water bath. Following 
incubation, the membranes were washed four times in 0.2 X SSC with 0.2% SDS at 
65°C. The membranes were then dried on blotting paper for 15 minutes, wrapped in 
SaranWrap™, and exposed to X-ray film for 48 hours at -80°C with intensifying screens 

1 5 (Kodak, Rochester, NY). 

Eleven plaques, designated clones 12, 14, 15, 16, 18, 20, 21, 22, 23, 26, and 27, 
produced strong hybridization signals with the probe. These were plaque purified and the 
pZLl plasmids were excised from the purified phagemid clones according to the protocol 
suggested by Life Technologies (Gaithersburg, MD). Colonies were inoculated into 

20 three ml of LB plus 50 figfail ampicillin medium and grown overnight at 37°C. 
Miniprep DNA was prepared from each of these clones using the Qiagen Bio Robot 9600 
according to the manufacturer's protocol. 

DNA sequencing of each clone was performed with an Applied Biosystems Prism 
377 DNA Sequencer using the BigDye Terminator Cycle Sequencing Ready Reaction kit 

25 according to the manufacturer's instructions. Oligonucleotide sequencing primers were 
designed to complementary sequences in the pZLl plasmid vector and were synthesized 
by Operon Technologies Inc., Alameda, CA 

DNA sequencing indicated that the eleven clones contained all of the 
phospholipase B coding sequence (Figure 1). The region spanned by clones 16, 20, 23 

30 and 27 was sequenced to an average redundancy of 5. 

Example 3: PCR amplification of Aspergillus oryzae phospholipase B gene 

The complete phospholipase B coding sequence was amplified by PCR from 
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clone 16.1 #22 using the Expand High Fidelity PCR kit (Roche Molecular Biochemicals, 
Indianapolis, IN) according to the manufecturer's suggested protocol using the following 
primers which incorporated Swal or Pad cloning sites: 

Sense primer. 5 , -ATTTAAATGATGAAGTCCACCGCTCTG-3 , (SEQ ID NO. 3) 
5 Antisense primer S'^STTAATTAAACTTATACCGAAAAGGTATGG-S 1 (SEQ ID NO. 
4) 

Amplification reactions (50 jil) were prepared using approximately 0.2 fig of 
heat-denatured plasmid DNA as the template. Each reaction contained the following 
components: 0.2 ng of heat-denatured plasmid DNA, 15 pmol of the forward primer, 15 

10 pmol of the reverse primer, 250 jiM each ofdATP, dCTP, dGTP, and dTTP, 1 x buffer, 
and 3.5 units of DNA polymerase mixture (Tag and Pwo DNA polymerases). The 
reactions were incubated in a Strategene RoboCycler programmed for 12 cycles each at 
94°C for 45 seconds, 55°C (first cycle) or 57°C for 30 seconds, 72 °C for 120 seconds, 
and 72°C post-cycle for 5 minutes. 

* 5 ^ reaction products were isolated on a 0.8% agarose gel using TAB buffer 

where a 1.6 kb product band was excised from the gel and purified using a QIAquick kit 
(QIAGEN, Chatsworth, CA) according to the manufacturer's instructions. The L6 kb 
fragment was cloned into pCR®2.1-TOPO (Invitrogen, San Diego, CA) after addition of 
3' A-overhangs according to the manufacturer's suggested protocol The insert from one 

20 selected clone was removed by sequential digestion with Pad and Swal, gel purified, and 
ligated into similarly digested pBANelS. The ligation mixture was cloned into £. coli 
DH5a cells to produce E. coli DH5a pPH6. The resulting plasmid containing the full- 
length phospholipase B gene was designated pPH6. 

15 Example 4: DNA sequence analysis of Aspergillus oryzae phospholipase B gene 

DNA sequencing of the phospholipase B gene contained in pPH6 of & coli 
DH5a pPH6 was performed as described in Example 2 using Ml 3 reverse and T7 
oligonucleotide sequencing primers (Operon Technologies, Inc). 

The nucleotide sequence of the gene encoding the Aspergillus oryzae HowB430 
0 phospholipase B and the deduced amino acid sequence thereof is shown in Figure 1 
(SEQ ID NOs. 1 and 2, respectively). Sequence analysis of the cloned insert revealed a 
coding sequence of 1392 bp (excluding the stop codon) encoding a protein of 464 amino 
acids sequence (SEQ ID NO. 2). The coding sequence is punctuated by two introns of 92 
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bp and 51 bp. The G+C content of the coding sequence is 553%. The predicted mature 
phospholipase B polypeptide has a molecular mass of 49.4 kdal and an isoelectric point 
of 4.76. Based on the rules of van Heijne (van Heijne, 1984, Journal of Molecular 
Biology 173: 243-251), the first 19 amino acids likely comprise a secretory signal peptide 
5 which directs the nascent polypeptide into the endoplasmic reticulum. 

A comparative multiple alignment of phospholipase B polypeptide sequences was 
determined by the ClustalW method using the ClustalX software (Thompson et al 9 1994, 
Nucleic Acids Research 22: 4673-4680; Thompson et a/., 1997, Nucleic Adds Research 
25: 4876-4882) with a blosum weight matrix, gap opening penalty of 10, and gap 
10 extension penalty of 0.03 to 0.05. The comparative alignment showed that the 
Aspergillus oryzae HowB430 phospholipase B shares 26% identity with a phospholipase 
C from Pseudomonas aeruginosa (SwissProt Acc. No. P15713). 

Example 5: Expression of Aspergillus oryzae phospholipase B in Aspergillus oryzae 
15 PlasmidpPH6 was digested with Pmel to isolate the phospholipase B expression 

cassette, which was purified by agarose gel electrophoresis and recovered using the 
QIAquick kit. 

The purified fragment was transformed into Aspergillus oryzae strain BECh2 
(Aa§?, tuipl) using the following protoplast transformation method. Protoplasts of 

20 Aspergillus oryzae strain BECh2 were prepared according to the method of Christensen 
et aL y 1988, Bio/Technology 6: 1419-1422. The transformation was conducted with 
protoplasts at a concentration of ca. 2xl0 7 protoplasts per ml. One hundred fj.1 of 
protoplasts were placed on ice for 5 minutes with ca. 2.5 ng of the Pmel fragment; 250 p.1 
of 60% polyethylene glycol 4000, IQmM Tris-HCl, pH 7.5, 10 mM CaCl 2 was added, 

25 and the protoplasts were incubated at 37°C for 30 minutes. Three mis of STC (1*2 M 
soibitol, lOmM Tris-HCl, pH 7.5, and 10 mM CaC^ ) was added. The solution was 
mixed gently and poured onto 150 mm COVE transformation plates (per liter 0.52 g of 
KC1, 0.52 g of MgSO 4 -7H 2 0, 1.52 g of KHjPO,, 1 ml of trace metals described below, 
342 J g of sucrose, 25 g of Noble agar, 10 ml of 1 M acetamide, 10 ml of 3 M CsCl). 

30 The trace metals solution (1000X) was comprised of 22 g of ZnSO^H^O, 11 g of 
H,B0 3 , 5 g of MnCl 2 -4H 2 0, 5 g of FeS0 4 -7H 2 0, 1.6 g of CoOf-SHA 1.6 g of 
(NHJ^oA^ and 50 g of N^gSDTA per liter. Plates were incubated 5-7 days at 34°C 
until colonies appeared. Putative transformants were spore purified twice on COVE- 
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Vogel's medium. Ultimately 18 transfonnants were recovered by their ability to utilize 
acetamide as the sole nitrogen source. All transfonnants were subsequently propagated 
on minimal medium containing 1% glycerol as carbon source. 

For initial small-scale expression analysis, freshly collected spores of the 
5 transfonnants were inoculated in 125 ml baffled shake flasks containing 25 ml of 
medium composed per liter of 50 g of maltodextrin, 2 g of MgS0 4 7H 2 0, lOgofKH^O^ 
2 g of KjSO^ 2 g of citric acid, 10 g of yeast extract, 2 g of urea, 7.1 jig of ZnS0 4 7H 2 0, 
13 >ig of CuS0 4 5HA 025 fig of NiC^fiHA 0.7 fig of FeS0 4 7HA 43 fig of MnSCV 
HjO, and 1.5 fig of citric acid par liter, titrated to pH 6.0 with sodium hydroxide. The 
10 cultures were cultivated for 3-5 days at 34°C with constant shaking at 200 ipm. 

Culture supematants from the transfonnants were submitted to SDS-PAGE using 
8-16% Tris-Glycine SDS-PAGE gels (Novex, San Diego). A doublet band of 
approximately 55-65 kDa was observed in all of the supematants. 

Culture supematants from two of the transfonnants were assayed for 
1 5 phospholipase B activity using 9 mM p-nitrophenylphosphorylcholine as substrate in 0.1 
M MOPS-4 mM CaClj pH 7.5 at 30°C. The assays showed the presence of enzyme 
activity toward p-nitrophenylphosphorylcholine. 

The assay results showed that the transfonnants produced detectable 
phospholipase B activity. 

20 

Example 6: Purification of recombinant phospholipase B 

A volume of 550 ml of whole culture broth obtained as described in Example 5 
was centrifuged at 6000 rpm for 30 minutes. The supernatant of 140 mis was diluted 
with deionized water to 840 mis to a conductivity of 2.3 mS, pH 6.8. The diluted broth 

25 was then filtered through a Millipore Express 0.22 \im filter. 

Q-Sepharose Big Beads (Pharmacia Biotech, Inc., Piscataway, NJ) were loaded 
into an XK-26 column to a volume of approximately 80 ml of resin and pre-equilibrated 
with 500 ml of 20 mM sodium phosphate pH 7.0. The sample was then loaded, followed 
by washing with 20 mM sodium phosphate pH 7.0 until baseline absorbatice at 280 nm 

30 was achieved- A 500 ml gradient was run from 0 to 030 M NaCl in 20 mM sodium 
phosphate pH 7.0 at a flow rate of 5 ml minute for 100 minutes. Fractions of 10 ml were 
collected and assayed using p-mtrophenyl phosphoryl choline (Sigma Chemical Co., St 
Louis, MO) as a substrate at a concentration of 9 mM in 100 mM MOPS-4 mM CaClj pH 
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7.5. Active fractions were also analyzed by SDS-PAGE using 8-16% Tris-Glycine SDS- 
PAGE gels. The purified phospholipase was seen as a doublet at approximately 55-65 
fcDa as with the erode whole broth. 

5 Example 7: Protein sequencing and amino acid analysis of phospholipase B 

N-terminal sequencing of the whole broth and purified phospholipase B obtained 
as described in Example 6 was performed on an Applied Biosystems 476A Protein 
Sequencer (Perkin Elmer/Applied Biosystems Division, Foster City, CA) with on-line 
HPLC and liquid phase trifluoroacetic acid (TFA) delivery. The phospholipase 

10 preparations were submitted to SDS-PAGE using Novex 8-16% Tris-glycine SDS-PAGE 
gels according to manufacturer's suggested conditions (Novex, San Diego, CA). The gels 
were electroblotted to a PYDF membrane (Novex, San Diego, CA) for 2 hours at 25 volts 
in 10 mM CAPS pH 1 1 .0 buffer. The PVDF membrane was stained in 0.1% Commassie 
Blue R250 in 40% methanol/1% acetic acid and the observed bands excised. The excised 

15 bands were sequenced from a blot cartridge using sequencing reagents (Perkin 
Elmer/Applied Biosystems Division, Foster City, CA). Detection of 
phenylthiohydantoin-amino acids was accomplished by on-line HPLC using Buffer A 
containing 3.5% tetrahydrofuran in water with 18 ml of the Prank concentrate (Perkin . 
Elmer/Applied Biosystems Division, Foster City, CA) containing acetic acid, sodium 

20 acetate, and sodium hexanesulfonate and Buffer B containing acetonitrile. Data was 
collected and analyzed on a Macintosh Ilsi using Applied Biosystems 610 Data Analysis 
software. Sequence determinations were made by visualizing chromatograms against a 
light source. 

The major doublet band of the purified phospholipase B was N-terminally 
25 sequenced and found to contain the expected N-terminus SPVTSEYTSVREAPE (SEQ 
ID NO. 2). The doublet band from the whole broth culture was also subjected to N- 
tenninal sequencing and was found to contain the predicted N-tenninns (Ser-Pro-Val- 
Thr-Ser-Glu-Tyr-Thr-Ser-Val-Aig-....), N-terminus - 3 a.a r s (Thr-Ser-Glu-Tyr-Thr-....), 
and N-terminus - 15 a.a's (Gly-Tyr-Lys-Pro-Gly-Ser-....). 

30 

Example 8: Thermostability determination of phospholipase B 

A VP-DSC (MicroCal Inc., Amherst, MA) was employed according to the 
manufacturer's instructions to determine the thermostability of the phospholipase B under 
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various pH conditions. Sample and reference solutions were carefully degassed 
immediately prior to loading of samples into die calorimeter. The reference contained 
buffer with no protein addition. Sample and reference solutions (approx. 0.5 ml) were 
thermally pre-eqmlibrated for 20 minutes at 5°C prior to DSC scanning from 5°C to 95°C 
5 at a rate of 60°C per hour. 

Buffers consisted of 50 mM sodium acetate pH 4,5, 50 mM Tris-acetate pH 7, or 
50 mM borate, pH 9. At pH 4.5 the protein displayed a single Td at 64.8°C; at pH 7 the 
protein presented two transitions at 44.1°C and 55^°C, respectively; and at pH 9 the 
protein transition occurred at 35.5°C. Thus, the protein displayed the highest stability at 
10 pH4.5. 

Example 9: Characterization of phospholipase B activity 

The assay for determination of lecithase and lysolecithase activity employed a 
NEFA C chromogenic assay (Wako Chemicals GmbH) for liberation of free fatty acids. 

15 Lecithin (Avanti Polar Lipids, 95% pure from plants) or lysolecithin (Sigma L4129, egg 
yolk) substrates were prepared at 10 mg per ml of distilled water and then diluted to 1 mg 
per ml in 20 mM HEPES pH 7 buffer. Lecithin dissolution required heating to 55°C to 
prepare a suspension. Hie assay consisted of adding 10 of enzyme solution to 150 \il 
of (he substrate solution. The reaction was incubated for ten minutes at 37°C Following 

20 the incubation period analysis of liberated free fatty acids was performed. The iecithin 
substrate yielded no detectable colored product, but the lysolecithin substrate incubation 
resulted in a slight color. 

A second assay was performed at pH 5 in 50 mM sodium acetate by incubating 
10 nl of enzyme solution with 150 jiLof 1 mg of lyolecithin per ml for one hour at 37°C. 

25 The reaction was stopped by the addition of 1 ml CHCVethyl acetate 1:9 v/v. The 
mixture was centrifuged and the supernatant was analyzed by reverse phase 
chromatography according to the method of Marcato and Cocchin, 1996, J. Chromatogr. 
A730: 83-90, with detection by light scattering. Incubation with lysolecithin resulted in 
the detection of a product peak with a retention time which corresponded to a palmitic 

30 acid standard. 

Deposit of Biological Material 

The following biological material has been deposited under the terms of the 
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Budapest Treaty with the Agricultural Research Service Patent Culture Collection, 
Northern Regional Research Center, 1815 University Street, Peoria, Illinois, 61604, and 
given the following accession number 

Deposit Accession Number Date of Deposit 

5 KcolipPm NRRLB-30142 June 10, 1999 

The strain has been deposited under conditions that assure that access to the 
culture will be available during the pendency of this patent application to one determined 
by the Commissioner of Patents and Trademarks to be entitled thereto under 37 CFJL 
§1.14 and 35 U.S.C. §122. The deposit represents a substantially pure culture of the 
10 deposited strain. The deposit is available as required by foreign patent laws in countries 
wherein counterparts of the subject application, or its progeny are filed. However, it 
should be understood that the availability of a deposit does not constitute a license to 
practice the subject invention in derogation of patent rights granted by governmental 
action. 

15 

The invention described and claimed herein is not to be limited in scope by the 
specific embodiments herein disclosed, since these embodiments are intended as 
illustrations of several aspects of the invention. Any equivalent embodiments are 
intended to be within the scope of this invention. Indeed, various modifications of the 

20 invention in addition to those shown and described herein will become apparent to those 
skilled in the art from the foregoing description. Such modifications are also intended to 
fell within the scope of the appended claims. In the case of conflict, the present 
disclosure including definitions will control. 

Various references are cited herein, the disclosures of which are incorporated by 

25 reference in their entireties. 
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Applicant's or agent* 3 file 
reference number 


595L204-WO 


International abpJ 
To Be Assigned 


INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13 bis) 


A. Hie indications made below relate to the microorganism referred to in the description 
on Page 5Q , line ss . r 


B. IDENTIFICATION OF 


Farther deposits are identified on an additional sheet 


Name of depository institution 

Agricultural Research Service Patent Culture Collection (NKRL) 

Address of depository institution (indtidmg postal code and country) 

Northern Regional Research Center 
1815 University Street 
Peoria, IL 6i604,US 


Date of deposit 
10 June 1399 


Accession Number 
NRRL R-30142 


C. ADDSTIOMAL INDICATIONS (lean blank if not anteable) Misinformation is continued on an «W* 


Until the publication of the mention of] 
tiling if the application has been 


rant |of a European iMent or, where applicable, for twenty years from the date of 

rt „L-i A i r j witcdrawn or deemed withdrawn, a sample of the deposited microorganism is 

^Ll^PJ^r ^ to IDde ? en l dent expert nominated by the person requesting the sample ^Ruk^)^K^nd 
fl^A?^ 15 concerned, the expert option is likewise requested, i3erencel)Sng had to Reeu^ion 3 25of 
^^n^MS^? 0 " ^ Ako,forCar^ we re^^onry an in^ 

Commissioner is authorized to have access to a sample of the inicroorganism deposited. ^f"™""™***"*™ 


D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 


E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 


For receiving Office use only 


This sheet wis received whh the international awfication 


Authomed offer J^UL p URROTIA ^/ /( 


form PCT/kO/u* (Jury 1992> 


For International Bureau use only 


This sheet was received with the International Bureau on: 


Authorized officer 
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What is claimed is: 

5 L An isolated polypeptide having phospholipase B activity, selected from the group 
consisting of: 

(a) a polypeptide having an amino acid sequence which has at least 65% 
identity with amino acids 20 to 464 for the mature polypeptide of SEQ ID NO. 2; 

(b) a polypeptide which is encoded by a nucleic acid sequence which 
10 hybridizes under low stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID 

NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, 
(iii) a subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) a complementary strand 
of(i),(ii), or(iii); 

(c) a variant of the polypeptide having an amino acid sequence of SEQ ID 
15 NO. 2 comprising a substitution, deletion, and/or. insertion of one or more amino acids; 

(d) an allelic variant of (a) or (b); and 

(e) a fragment of (a), (b), or (d) that has phospholipase B activity. 

2. The polypeptide of claim 1, having an amino acid sequence which has at least 
20 65% identity with amino acids 20 to 464 of SEQ ID NO. 2. 

3. The polypeptide of claim 2, having an amino acid sequence which has at least 
70% identity with amino acids 20 to 464 of SEQ ID NO. 2. 

25 4. The polypeptide of claim 3, having an amino acid sequence which has at least 
80% identity with amino acids 20 to 464 of SEQ ID NO. 2. 

5. The polypeptide of claim 4, having an amino acid sequence which has at least 
90% identity with amino acids 20 to 464 of SEQ ID NO. 2. 

30 

6. The polypeptide of claim 5, having an amino acid sequence which has at least 
95% identity with amino acids 20 to 464 of SEQ ID NO. 2. 
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7. The polypeptide of any of claims 1-6, comprising the amino acid sequence of 
SEQIDNO.2. 

8. The polypeptide of any of claims 1-7, consisting of the amino acid sequence of 
5 SEQ ID NO. 2 or a fragment thereof 

9. The polypeptide of claim 8, consisting of the amino acid sequence of SEQ ID 
NO. 2. 

10 10. The polypeptide of claim 9, which consists of amino acids 20 to 464 of SEQ ID 
NO. 2. 

11. The polypeptide of claim 1, which is encoded by a nucleic acid sequence which 
hybridizes under low stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID 

15 NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, 
(iii) a subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) a complementary strand 
of(i),(ii),or(m> 

12. The polypeptide of claim 1 1, which is encoded by a nucleic acid sequence which 
20 hybridizes under low stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID 

NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, 
or (iii) a complementary strand of (i) or (ii). 

13. The polypeptide of claim 1, which is encoded by a nucleic acid sequence which 
25 hybridizes under medium stringency conditions with (i) nucleotides 568 to 2045 of SEQ 

ID NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 
1, (iii) a subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) a complementary 
strand of(i),(ii), or (iii). 

30 14. The polypeptide of claim 13, which is encoded by a nucleic acid sequence which 
hybridizes under medium stringency conditions with (0 nucleotides 568 to 2045 of SEQ 
ID NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 
1 , or (iii) a complementary strand of (i) or (ii). 
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15. The polypeptide of claim 1, which is encoded by a nucleic acid sequence which 
hybridizes under high stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID 
NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, 

5 (iii) a subsequence of® or (ii) of at least 100 nucleotides, or (iv) a complementary strand 
of(i),(ii), or (iii). 

16. The polypeptide of claim 15, which is encoded by a nucleic acid sequence which 
hybridizes under high stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID 

10 NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, 
or (iii) a complementary strand of (i) or (ii). 

17. The polypeptide of claim 1, wherein the polypeptide is a variant of the 
polypeptide having an amino acid sequence of SEQ ID NO. 2 comprising a substitution, 

15 deletion, and/or insertion of one or more amino acids. 

18. The polypeptide of claim 1, which is encoded by the nucleic acid sequence 
contained in plasmid pPH6 which is contained in E. coli NRRL B-30142. 

20 19. The polypeptide of any of claims 1-18 which has at least 20% of the 
phospholipase B activity of SEQ ID NO. 2. 

20. A polypeptide having the same phospholipase B activity as the polypeptide of any 
of claims 1*19. 

25 

21. An isolated nucleic acid sequence comprising a nucleic acid sequence which 
encodes the polypeptide of any of claims 1-20. 

22. An isolated nucleic acid sequence comprising a nucleic acid sequence having at 
30 least one mutation in the mature polypeptide coding sequence of SEQ ED NO; 1 , in which 

the mutant nucleic acid sequence encodes a polypeptide consisting of amino acids 20 to 
464ofSEQIDN0.2. 
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23. An isolated nucleic acid sequence produced by (a) hybridizing a DNA under low 
stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID NO. 1, (ii) die cDNA 
sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, (iii) a subsequence of 
(i) or (ii) of at least 100 nucleotides, or (iv) a complementary strand of (i)„ (ii), or (iii); 

5 and (b) isolating the nucleic acid sequence. 

24. The isolated nucleic acid sequence of claim 23 produced by (a) hybridizing a 
DNA under medium stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID 
NO. 1, (ii) the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, 

10 (iii) a subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) a complementary strand 
of (i) , (ii), or (iii); and (b) isolating the nucleic acid sequence. 

25. An isolated nucleic acid sequence of claim 24 produced by (a) hybridizing a DNA 
under high stringency conditions with (i) nucleotides 568 to 2045 of SEQ ID NO. 1, (ii) 

15 the cDNA sequence contained in nucleotides 568 to 2045 of SEQ ID NO. 1, (iii) a 
subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) a complementary strand of 
(i), (ii), or (iii); and (b) isolating the nucleic acid sequence. 


20 


25 


26. A nucleic acid construct comprising the nucleic acid sequence of claim 21 
operably linked to one or more control sequences that direct the production of the 
polypeptide in a suitable expression host 

27. A recombinant expression vector comprising the nucleic acid construct of claim 
26. 

28. A recombinant host cell comprising the nucleic acid construct of claim 26. 


29. A method for producing a mutant nucleic acid sequence, comprising (a) 
introducing at least one mutation into the mature polypeptide coding sequence of SEQ ID 
30 NO. 1, wherein the mutant nucleic acid sequence encodes a polypeptide consisting of 
amino acids 20 to 464 of SEQ ID NO. 2; and (b) recovering the mutant nucleic acid 
sequence. 


-55- 


NZNA-0013487 


10 


WO 01/29222 PCI7US00/29140 

30. A mutant nucleic acid sequence produced by the method of claim 29. 

31. A method for producing a polypeptide, comprising (a) cultivating a strain 
comprising the mutant nucleic acid sequence of claim 30 encoding the polypeptide to 

5 produce a supernatant comprising the polypeptide; and (b) recovering the polypeptide. 

32. A method for producing the polypeptide of any of claims 1-20 comprising (a) 
cultivating a strain to produce a supernatant comprising the polypeptide; and (b) 
recovering the polypeptide. 

33. A method for producing the polypeptide of any of claims 1-20 comprising (a) 
cultivating a host cell comprising a nucleic acid construct comprising a nucleic acid 
sequence encoding the polypeptide under conditions suitable for production of the 
polypeptide; and (b) recovering the polypeptide. 

15 

34. A method for producing a polypeptide comprising (a) cultivating a host cell under 
conditions conducive for production of the polypeptide, wherein the host cell comprises a 
mutant nucleic acid sequence having at least one mutation in the mature polypeptide 
coding sequence of SEQ ID NO. 1, wherein the mutant nucleic acid sequence encodes a 

20 polypeptide consisting of amino acids 20 to 464 of SEQ ID NO. 2, and (b) recovering the 
polypeptide. 

35. A method for producing the polypeptide of any of claims 1-20 comprising (a) 
cultivating a homologously recombinant cell, having incorporated therein a new 

25 transcription unit comprising a regulatory sequence, an exon, and/or a splice donor site 
operably linked to a second exon of an endogenous nucleic acid sequence encoding the 
polypeptide, under conditions conducive for production of the polypeptide; and (b) 
recovering the polypeptide. 

30 36. A method for producing a mutant of a cell, which comprises disrupting or 
deleting a nucleic acid sequence encoding the polypeptide of any of claims 1-20 or a 
control sequence thereof which results in the mutant producing less of the polypeptide 
than die cell. 
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37. A mutant produced by the method of claim 36. 

38. The mutant of claim 37, which further comprises a nucleic acid sequence 
5 encoding a heterologous protein. 

39. A method for producing a heterologous polypeptide comprising (a) cultivating the 
mutant of claim 38 under conditions conducive for production of the polypeptide; and (b) 
recovering the polypeptide. 

10 

40. A nucleic acid construct comprising a gene encoding a protein operably linked to 
a nucleic acid sequence encoding a signal peptide consisting of nucleotides 510 to 567 of 
SEQ ID NO. 1, wherein the gene is foreign to the nucleic acid sequence. 

15 41. A recombinant expression vector comprising the nucleic acid construct of claim 
40. 

42. A recombinant host cell comprising the nucleic acid construct of claim 40. 

20 43. A method for producing a protein comprising (a) cultivating the recombinant host 
cell of claim 42 under conditions suitable for production of the protein; and (b) 
recovering the protein. 

44. A method for reducing the phospholipid content in an edible oil, comprising 
25 treating the oil with the polypeptide of claim 1 to hydrolyze the phospholipid and 

separating the hydrolyzed phospholipid from the oil. 

45. A method for reducing the phospholipid content of an aqueous carbohydrate 
solution or slurry, comprising treating the aqueous carbohydrate solution or shiny with 

30 the polypeptide of claim 1 to hydrolyze the phospholipid and separating the hydrolyzed 
phospholipid from the aqueous carbohydrate solution or slurry. 

46. A method for preparing a dough, comprising incorporating into the dough an 

- 57 - 


NZNA-00 13489 


WO 01/29222 PCT/DSOO/29140 

effective amount of the polypeptide of claim 1 to improve one or more properties of the 
dough and/or a baked product obtained from the dough. 

47. The method of claim 46, wherein the one or more improved properties are 
5 selected from the group consisting of increased strength of the dough, increased stability 

of the dough, reduced stickiness of the dough, improved machinability of the dough, 
increased volume of the baked product, improved crumb structure of the baked product, 
improved softness of the baked product, improved flavor of the baked product, and 
improved antistaling of the baked product 

10 

48. A method for preparing a baked product, comprising baking a dough produced by 
the method of claim 46 or 47 to produce a baked product 

49. A dougih product obtained from a dough prepared by the method of claim 46 or 
15 47. 

50. A baked product produced by the method of claim 48. 
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. SEQUENCE LISTING 

<110> Harris, Paul 

Brown, Kimberly M. 

<120> Polypeptides Having Phospholipase B 
Activity And Nucleic Acids Encoding Same 


<130> 5951.204-WO 

<140> To Be Assigned 
<141> 2000-10-20 

<150> 09/426,072 
<151> 1999-10-21 

<160> 4 


<170> PastSEQ for Windows Version 4.0 

<210> 1 
<211> 2318 
<212> DNA 

<213> Aspergillus oryzae 
<400> 1 

aattcctgga caacaatccc tttgagagtg atagtaaggg ggatgaactg agatgctaag ' 60 

ctcacgttct tcgccttggc gtaggctgcc ccatgtcttg gtttagtgct tctctccacc 120 

ctagtagctt agccattgct tgtccaatcc cattcccatc tcgcatcacg ctctatttat 180 

gaccaaggct gtcaaatgag agccccactc atgtggcctc cacttcatat tgttttcttt 240 

gtcgataaga ctgtcatccc gtctccggtg gcttccacga agaacgattt gtaaagatgg 300 

atagcgatag gctgcccttg gtcggtatga agccgaagtc cctcgtgcct cgatggatat 360 

gcgcttcaac ccgacgcaca gcgcacgtgg tatccgaatg ggacggaaag gtagctcctc 420 

cagtgggcta tataagcctt tccctggccg ggctcagact ccctagcaag cagcattgtt 480 

caagctcacg acctcaatcc gcgagtaaac atgaagtcca ccgctctgct tactggtctc 540 

ggcctcttgg cctctctcgg tcttgctagc cctgtcacgt ccgagtatac gagcgtgcga 600 

gaagcccctt tcggatacaa gcctggctcc aaggagtcca ttgagaactt gaaggacaag 660 

gtcgagaaca ttgtctggct tattctcgag aacaggtttg tgtgccctta cgatattcat 720 

atgtggaata ataaattcct caattcagct tgtggtatgt gaagacgagc actaacatat 780 

ggtccagatc cttcgataac attctgggag gcgtgcgccg ccaaggactg gacaacccga 840 

tcaacaacgg cccgttctgc aactacaaga atgcgagcga cccatcctcg ggcaagtact 900 

gtactcaggc caaggactat gattccgtgt tcaacgatcc agaccactcc gtgactggta 960 

ataacttgga gttctacgga acttacaccc caaacaatgg tgcgattgcc agtggcaagg 1020 

tcgtcgccga ccagtctggc ttcctcaacg cacagcttaa cgactacccc aaactggccc 1080 

cagaagaggc gacaaggcaa gtgatgggat actatacgga ggaggaggtt cctacgctcg 1140 

tggaccttgt ggatgagttc actactttca acagctggtt ctcgtgtgtt cctggcgtaa 1200 

gtgcgataaa tctgtatatt tttaattttg actaacgcgt tcaaaagcct accaacccca 1260 

accgcttgtg cgctctggca ggaaccgctg ctgggcatgg caagaatgac gatgacttcc 1320 

tgaactatgg tatctctagc aagtccatct tcgaggccgc caacgagaag ggcgtgtcct 1380 

ggctcaacta cgatggcacc aacggagaat tcgaaccgga ttctctcttc ttcacctacg 1440 

tcaaccagac ctcccggtcc aacgtggtgc ccgttgaaaa cttcttccaa gacgcctacc 1500 

tcggtgtcct ccctaaattc tcttacatta acccctcctg ctgcggcacc aacaccaact 1560 

ccatgcaccc caccggtaac gtctcctacg gtgaggtctt cgtcaagcag atctatgatg 1620 

ccattcgcca gggccctcag tgggacaaga ccctgctctt cattacctac gacgagaccg 1680 

gtggcttcta cgaccatgtc cctccccctc tcgccgtccg cccggacaac ctgacctaca 1740 

ctgagactgc gaagaacggt cagaaataca ctcttcactt cgaccgtctg ggtggccgca 1800 

tgccgacctg ggttatctcc ccttacagta agaagggata catcgagcag tacggaacgg 1860 

atcccgtcac gggcaagccc gctccctaca gtgctacctc cgtcctcaag actctcggat 1920 
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atctctggga catcgaggac ttcacccctc gtgtcgccca ctctccatct ttcgatcacc 1980 

tgatcggcac gactttgcgt gaggatgctc ctattgctct caagactccc catacctttt 2040 

cggtataagt . ctcagtccgt gcagtgagac aggattaaaa gtgatgatga acgttctgac 2100 

ttcagtgaac gattacatgt tatagagcat tgttttgctt atagctacgc ctagagcgag 2160 

cgcgatgatg ataagataaa gctgggttat ttctctattg tatattcatt aatgaaagac 2220 

tttgataaca tgggatttaa aaaggaaatg ttttcttgca caatcaactc acggaacagg 2280 

cgtaacttta cgatgactgc accccggata cattaatt 2318 

<210> 2 
<211> 464 
<212> PRT 

<213> Aspergillus oryzae 
<400> 2 

Met Lys Ser Thr Ala Leu Leu Thr Gly Leu Gly Leu Leu Ala Ser Leu 

1 5 10 15 

Gly Leu Ala Ser Pro Val Thr Ser Glu Tyr Thr Ser Val Arg Glu Ala 

20 25 30 

Pro Phe Gly Tyr Lys Pro Gly Ser Lys Glu Ser He Glu Asn Leu Lys 

35 40 45 

Asp Lys Val Glu Asn lie Val Trp Leu lie Leu Glu Asn Arg Ser Phe 

50 55 60 

Asp Asn He Leu Gly Gly Val Arg Arg Gin Gly Leu Asp Asn Pro He 

65 70 75 80 

Asn Asn Gly Pro Phe Cys Asn Tyr Lys Asn Ala Ser Asp Pro Ser Ser 

85 90 95 

Gly Lys Tyr Cys Thr Gin Ala Lys Asp Tyr Asp Ser Val Phe Asn Asp 

100 105 no 

Pro Asp His Ser Val Thr Gly Asn Asn Leu Glu Phe Tyr Gly Thr Tyr 

115 120 125 

Thr Pro Asn Asn Gly Ala He Ala Ser Gly Lys Val Val Ala Asp Gin 

130 135 140 

Ser Gly Phe Leu Asn Ala Gin Leu Asn Asp Tyr Pro Lys Leu Ala Pro 
145 150 155 160 

Glu Glu Ala Thr Arg Gin Val Met Gly Tyr Tyr Thr Glu Glu Glu Val 

165 170 175 

Pro Thr Leu Val Asp Leu Val Asp Glu Phe Thr Thr Phe Asn Ser Trp 

180 185 190 

Phe Ser Cys Val Pro Gly Pro Thr Asn Pro Asn Arg Leu Cys Ala Leu 

195 200 205 

Ala Gly Thr Ala Ala Gly His Gly Lys Asn Asp Asp Asp Phe Leu Asn 

210 215 220 

Tyr Gly He Ser Ser Lys Ser He Phe Glu Ala Ala Asn Glu Lye Gly 
225 230 235 240 

Val Ser Trp Leu Asn Tyr Asp Gly Thr Asn Gly Glu Phe Glu Pro Asp 

245 250 255 

Ser Leu Phe Phe Thr Tyr Val Asn Gin Thr Ser Arg Ser Asn Val Val 

260 265 270 

Pro Val Glu Asn Phe Phe Gin Asp Ala Tyr- Leu Gly Val Leu Pro Lys 

275 280 285 

Phe Ser Tyr He Asn Pro Ser Cys Cys Gly Thr Asn Thr Asn Ser Met 

290 295 300 

His Pro Thr Gly Asn Val Ser Tyr Gly Glu Val Phe Val Lys Gin He 
305 310 315 320 

Tyr Asp Ala He Arg Gin Gly Pro Gin Trp Asp Lys Thr Leu Leu Phe 

325 330 335 

He Thr Tyr Asp Glu Thr Gly Gly Phe Tyr Asp His Val Pro Pro Pro 

340 345 350 

Leu Ala Val Arg Pro Asp Asn Leu Thr Tyr Thr Glu Thr Ala Lys Asn 
355 360 365 
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Gly Gin Lys Tyr Thr Leu His Phe Asp Arg Leu Gly Gly Arg Met Pro 

370 375 380 

Thr Trp Val He Ser Pro Tyr Ser Lys Lys Gly Tyr He Glu Gin Tyr 
385 390 395 40a 

Gly Thr Asp Pro Val Thr Gly Lys Pro Ala Pro Tyr Ser Ala Thr Ser 

405 410 415 

Val Leu Lys Thr Leu Gly Tyr Leu Trp Asp He Glu Asp Phe Thr Pro 

420 425 430 

Arg Val Ala His Ser Pro Ser Phe Asp His Leu He Gly Thr Thr Leu 

435 440 445 

Arg Glu Asp Ala Pro He Ala Leu Lys Thr Pro His Thr Phe Ser Val 
450 455 460 

<210> 3 
<211> 27 
<212> DNA 

<213> Aspergillus oryzae 
<400> 3 

atttaaatga tgaagtccac cgctctg 27 

<210> 4 
<211> 30 
<212> DNA 

<213> Aspergillus oryzae 
<400> 4 

gttaattaaa cttataccga aaaggtatgg 30 
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